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Our common understanding of the physical world deeply relies on the notion that events are 
ordered with respect to some time parameter, with past events serving as causes for future ones. 

Nonetheless, it was recently foimd that it is possible to formulate quantum mechanics without 
any reference to a global time or causal structure. The resulting framework includes new kinds of 
quantum resources that allow performing tasks - in particular, the violation of causal inequalities - 
which are impossible for events ordered according to a global causal order. However, no physical 
implementation of such resources is known. Here we show that a recently demonstrated resource 
for quantum computation - the quantum switch - is a genuine example of "indefinite causal order". 

We do this by introducing a new tool - the causal witness - which can detect the causal nonseparability 
of any quantum resource that is incompatible with a definite causal order. We show however that 
the quantum switch does not violate any causal inequality. 


I. INTRODUCTION 

It is commonly assumed that information is pro¬ 
cessed through a series of operations which are per¬ 
formed according to a specific order. This is justified 
by the assumption of a global, underlying time pa¬ 
rameter according to which all operations can be or¬ 
dered. A convenient representation of this structure 
is that of a circuit [1], Fig. i(a), in which systems are 
"wires" that connect "boxes", which represent opera¬ 
tions performed on the systems. At a more abstract 
level, a circuit only imposes a given causal structure 
between operations, as the time order between oper¬ 
ations that can be performed in parallel is irrelevant. 
The circuit framework is also ubiquitous in the study 
of quantum formdations to formalize generalized, pos¬ 
sibly post-quantum, probabilistic theories [2-5]. 

It has been suggested that such a framework might 
be too restrictive to encompass the most general 
kinds of information processing allowed by quantum 
physics [6]. For example, one can consider protocols 
in which the order between different operations is con¬ 
trolled by a quantum degree of freedom. It has been 
shown that such protocols exploiting a so-called "quan¬ 
tum switch" not only provide computational advan¬ 
tage over standard, time-ordered, ones [7, 8], but they 
are also physically realizable and a first experimental 
proof-of-principle has been recently demonstrated [9]. 
At a more fundamental level, an underlying time or 
causal order might not be well-defined in a theory that 
combines the dynamical causal structure of general rel¬ 
ativity and the probabilistic nature of quantum mechan¬ 


ics [10-12]. 

It is therefore natural to ask what the most general re¬ 
sources allowed by quantum mechanics beyond the cir¬ 
cuit model are. In Ref. [13] the process matrix formalism 
was proposed as a general framework to describe re¬ 
sources that can be accessed in "local laboratories" and 
which are locally in agreement with quantum physics. 
Fig. i(b). 

Causal relations are defined operationally in this for¬ 
malism. If, for example, through appropriate state 
preparations, an agent A can influence the outcomes 
of measurements performed by an agent B, whereas B 
is never able to influence A, then A causally precedes 
B by definition and, in this case, the physical resources 
available to them can in fact be represented as a circuit. 
A first example of a resource that cannot be represented 
as a circuit is a probabilistic mixture of circuits: a defi¬ 
nite order still exists between A and B in each run of 
an experiment, but which order is realized in a given 
run is only specified according to some probability dis¬ 
tribution. Resources compatible with a definite causal 
order, in this broader sense, are called causally separa¬ 
ble. Surprisingly, the formalism also allows for causally 
nonseparable resources, which are incompatible with any 
definite order between operations. It was found that 
a set of agents with access to a specific causally non¬ 
separable resource could perform a task, the violation 
of a causal inequality, which is impossible for arbitrary 
causally ordered strategies, even allowing probabilistic 
mixtures of orders [13]. However, there is no physical 
interpretation for such resources and no physically re¬ 
alizable protocol is known which can violate a causal 
inequality. 
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(a) (b) 


Figure i. (a) If the operations Mi of local agents are per¬ 
formed in a definite causal sequence, they can be represented 
as gates in a circuit, where information flows from bottom to 
top. (b) A process matrix formalizes a resource in which the 
order between operations may not be fixed. A probabilistic 
mixture of different orders is an example of a process ma¬ 
trix that does not correspond to a circuit. Still, in this case 
operations are performed in a well-defined order in each ex¬ 
perimental run; the most general resource with this property 
is called causally separable. The process matrix formalism also 
allows for the more general case of causally nonseparable re¬ 
sources [13]. 

It is therefore not completely clear what is the pre¬ 
cise relation between "quantum correlations with no 
causal order", which violate causal inequalities, and 
physically implementable resources, such as the quan¬ 
tum switch, which outperform causally ordered ones. 
To understand this relation, a crucial observation is that 
the causal inequalities are device-independent constraints: 
they are formulated independently of the physics of fhe 
systems or the specific apparatuses employed. On the 
other hand, the tasks discussed in Refs. [7, 8] include 
additional assumptions, as for example that in each lab¬ 
oratory quantum systems of a definife dimension have 
to be used. It is clear that, given additional restrictions, 
it is more difficult for causally-ordered agents to per¬ 
form certain tasks and, consequently, it can be easier to 
detect the lack of causal order in a physical resource. 

The aim of fhe presenf work is fo develop a general 
framework for the device-dependent detection of causal 
nonseparability. The central tool we introduce is what 
we call a causal witness, which represents a set of quan- 
fum operations, such as unitaries, channels, state prepa¬ 
rations, and measurements, whose expectation value 
is non-negative as long as all the operations are per¬ 
formed in a definife causal order, i.e., as long as only 
causally separable resources are used. The observation 
of a negative expecfation value is fhus sufficient to con¬ 
clude that the operations were not performed in a def¬ 
inife order. The concept is analogous to that of enfan- 


glement wifness: an observable thaf has a non-negative 
expectation value for separable sfates buf can have a 
negative expectation value for specific entangled states. 

We find fhat, for every causally nonseparable process, 
if is possible fo construcf a causal wifness fhat detects 
it. Importantly, and differently from the case of entan¬ 
glement witnesses, it is possible to use this method to 
write necessary and sufficient conditions for causal sep¬ 
arability in a form that can be checked efficiently using 
semidefinite programming (SDP). 

The tools developed are applied to the study of the 
quantum switch as a resource within the process ma¬ 
trix formalism. We show fhat, indeed, the quantum 
switch corresponds to a causally nonseparable process. 
We show that the protocol of Ref. [7] can be reformu¬ 
lated as a causal witness which detects the causal non¬ 
separability of the quantum switch. We also find new, 
more efficienf witnesses, which could be useful for ex¬ 
perimental implementations. 

We finally address the question of whefher the quan¬ 
tum switch can pass any device-independent test of 
causal nonseparability. As it turns out, this is not possi¬ 
ble: we prove that a broad class of resources, including 
fhe quanfum switch, cannot violate any causal inequal¬ 
ity. 

The paper is organized as follows: In Section II, we 
review the process matrix formalism, giving a conve¬ 
nient characterization of general and causally separa¬ 
ble process matrices for fhe cases of interest. In Sec¬ 
tion III, we introduce and characterize the central con¬ 
cept of causal wifness, and we present efficient algo¬ 
rithms for finding witnesses and for proving the causal 
(non)separability of a general process matrix. In Sec¬ 
tion IV we formalise the quantum switch as a process 
matrix. We proceed to prove its causal nonseparability 
in Section V, through the use of causal witnesses. One 
such witness is the task proposed in Ref. [7], thaf we op¬ 
timize fo increase its resistance to noise. Finally, we clar¬ 
ify in Section VI the link between causal witnesses and 
causal inequalities and show that the quantum switch 
cannot violate any causal inequality. 


II. THE PROCESS MATRIX EORMALISM 

In the general scenario we consider in this paper, N 
parties establish correlations by exchanging physical 
systems between their laboratories. Each party opens 
their laboratory only once to let an incoming system 
enter and to send an outgoing system out; they can 
act on these systems by performing an arbitrary opera¬ 
tion in their local laboratory, which can yield different 
measurement outcomes. The causal relations between 
the parties (i.e., the ordering of evenfs) are not a priori 
specified. The most general situation compatible with 
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the assumption that the operations performed in each lo¬ 
cal laboratory can he described by the quantum formalism 
can be conveniently represented in the "process ma¬ 
trix" formalism introduced in Ref. [13]. This extends the 
"comb" formalism of Ref. [14], which describes causally 
ordered quantum networks. The aim of the formalism 
is to characterize all possible probability distributions 
that can be obtained in our general scenario. The key 
concept is that of a process, which can be understood as 
the external resource determining the statistics of the lo¬ 
cal operations, and which generalizes both the notions 
of quantum state and of quantum charmel. The process 
matrix is a useful mathematical representation of such a 
concept. We shall use these two terms interchangeably. 


maximally entangled state, and T denotes matrix trans¬ 
position with respect to the chosen orthonormal basis 
of Some useful properties of the CJ iso¬ 
morphism are given in Appendix Ai. A map is com¬ 
pletely positive if and only if its CJ representation is 
positive semidefinite, while the trace-preserving condi¬ 
tion is equivalent to (where de¬ 

notes the partial trace over Aq, and is the identity 
matrix in Aj). An instrument is therefore equivalently 
represented as a set 

M^^o>0, tr,4^ £ 

® a=l 

(2) 


A. Local operations 


B. Process matrices 


Each party A acts in a local quantum laboratory, which 
can be identified by an input Hilbert space and 
an output Hilbert space xhe dimensions d/,^ and 

d^^ of input and output spaces do not have to be equal, 
as ancillary systems can be added or discarded during 
an operation; we shall nevertheless assume throughout 
the paper that all Hilbert spaces are finite-dimensional. 
According to quantum theory, the most general local 
operation is described by a completely positive (CP), 
trace non-increasing map : Aj —> Aq [15], where 
we write Aj, respectively Aq, for the space of hermi- 
tian linear operators over the Hilbert space resp. 

Examples of CP maps are deterministic opera¬ 
tions, such as unitaries or quantum channels, or (gen¬ 
eralized) measurements. In general, a label a, denot¬ 
ing the measurement outcome, is associated with the 
CP map The choice of operation (e.g. of mea¬ 

surement setting) is represented by an instrument [16], 
which is defined as the collection J'^ = {■M.^Ya=i 
of CP maps associated to all measurement outcomes, 
characterized by the property that Y^=i is CP and 
trace-preserving (CPTP). An instrument generalizes the 
notion of POVM (positive operator-valued measure) to 
include the transformations applied to the system; it 
reduces to a POVM for 1 -dimensional output spaces. 
When the choice of operation is described by a classical 
variable x, we will express such a dependence explicitly 

A convenient representation of CP maps is given by 
the Choi-Jamiolkowski (CJ) isomorphism [17]. Por a CP 
map : A/ —> Aq, its corresponding CJ matrix is 
defined here as 


M, 






e At 


iO/ 


( 1 ) 


where I is the identity map, | 1 L)) = := 

O 1;)'^' e O is a (non-normalized) 


As discussed in Ref. [13], requiring that quantum me¬ 
chanics holds locally implies that the probability that 
the N parties A' observe the outcomes a^,.. .,fljv, for a 
choice of operations xi,..., x^, is a multilinear function 

corresponding CP maps 

• • ■' f-J representation, it was 

shown that these probabilities can then be expressed as 




= tr 




(3) 


for some hermitian operator W G A| Aq 0... 0 Aj^ 0 
Aq called a process matrix, which describes the general 
quantum resource connecting the local laboratories. 

The set of valid process matrices is defined by requir¬ 
ing that probabilities are well-defined - that is, they 
must be non-negative and must sum up to 1 - for all 
possible operations, including operations that involve, 
in each laboratory, local interactions with ancillary sys¬ 
tems that may be entangled with the other laboratories. 
As we show in Appendix B, these conditions are equiv¬ 
alent to 


W > 0 , (4) 

tr W = dQ, (5) 

W = Lv(W), (6) 

where dQ = ... dy^N, and Ly is a projector onto the 

linear subspace Ly C AJ (g) Aq (g) ... (g) A^ (g) Aq de¬ 
fined in Appendix B. We will denote the closed convex 
cone of non-normalized processes defined by (4) and (6) 
by W. 

In the case of two parties A (Alice) and B (Bob), see 
Pigure 2, these conditions on W G Aj (g) Aq ^ Bj ^ Bq 
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reduce to 


case, is described by process matrices W satisfying 


W> 0 , 

(7) 

tr W = do, 

( 8 ) 

BiBo^ = AoBiBqV^ , 

( 9 ) 

AiAfj^P^ AiAqBq^P^ / 

(10) 

= BoW + noW-noBoW, 

(11) 


where (here and throughout the paper) the operator x' 
denotes the CPTP map consisting in tracing out the sub¬ 
system X and replacing it by the normalized identity 
operator, formally defined as 


W = 

aNanW = 4N-1,N„nW, 

42,12 = 414242 4N4NW. 

These conditions, first found in [14, 18], mean fhat 
party A' can only signal to party Ai if i < j. The 
process is therefore compatible with the causal order 
~< ~< A^. When this is the case, we write 

as a mnemonic 


xlV = (g) trx W. 

ax 


Ad\ 





A 


W 


B 







= _ (16) 

Process matrices of this form (and the obvious per¬ 
mutations) are called causally ordered. As shown in 
Refs. [14, 18], fhey correspond fo standard (causally 
ordered) quantum circuits and can be implemented as 
quantum channels with memory between the parties. 


2. Bipartite causally separable processes 


Figure 2. Representation of a bipartite process matrix W, con¬ 
necting Alice's (Aq) and Bob's (Bq) output systems to their 
input systems (Aj and Bj). 


1. Non-signalling and i-way-signalling process matrices 

Two important particular cases of process matrices 
may shed light on the above definition. The first case 
is when the process matrix does not allow for any sig¬ 
nalling, and the second one is when it allows for sig¬ 
nalling only in one fixed direction between the parties. 
They are discussed in more details in Appendix A 2. 

The first case is described by process matrices W sat¬ 
isfying 


W = Al...Af^ = ® (13) 

where is a densify mafrix representing an ordi¬ 

nary quantum state. In this case, the probability rule (3) 
reduces to the standard Born rule 


P(M, 


A^ 

flibi' 


■ ,M, 


4N 

“nI^n 


) =tr 


V «il 


til 






p ^ 

(14) 


where := tr4i are POVM elements. 

ai\xi Aq ai\xi 


The second case, of which fhe firsf one is a particular 


According to Eq. (15), a bipartite causally ordered 
process matrix compatible with the order A ~< 

B satisfies and 5,60^^^® = 

AoBiBo^^^^■ Nofe fhat the latter relation corresponds 
to Eq. (9) above, and is therefore automafically satis¬ 
fied if W'^^® £ Cv- Thus, a given matrix W'^^® £ 
Ai (g) Aq g) B; (g) Bq is a valid causally ordered process 
matrix compatible with the order A -< B if and only if 

W'^^® > 0, frW'^^® =do. 


W'^^® £ Lv and W'^^® = . (17) 

The analogous condition holds for the order B -< A. 

Note that a non-signalling process matrix W must be 
compatible with both orders A ~< B and B -< A. It 
must therefore satisfy W = BqW = or equiv- 

alenfly W = AqBq^I we indeed recover the form of 
Eq. (13). 

Following Ref. [13], we say that a bipartite process 
matrix W is causally separable if if can be decomposed 
as a convex combination of causally ordered processes, 
i.e., if if is of fhe form 


-h (18) 

with 0 < ^ < 1 . Ignoring the normalization constraint, 
the set of causally separable process matrices is a con¬ 
vex cone, which we denote by W®®P. A process matrix 
that cannot be decomposed as in (18) is called causally 
nonseparable. 
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3- Tripartite causally separable processes 

In this paper we will define tripartite causal sepa¬ 
rability only for processes where the output space of 
the third party C (Charlie) is trivial, i.e., dco ~ ^ 
Figure 3). As C cannot signal to the other parties, ev¬ 
ery process of this kind if compatible with C being last. 
Thus, only two causal orders are relevant in this case: 
A ^ B ~< C and B ~< A ~< C. The conditions for pro¬ 
cess matrices being compatible with these orders are, 
according to equation (15), 

^ (20) 

(21) 

and 

Y^B<A<C ^ (22) 

(23) 

•\ajB<A<C _ \\jB<A<C 

AiAqCiCq^ BoAjAqCiCo^ ■ ( 24 ) 

Since these three conditions together define a linear 
subspace, we can write them more succinctly as 

^ (25) 

y^B<A<C ^ (26) 

where Ly^^g^c ^od Lg^y^^c ^re the projectors onto the 
aforementioned subspaces. 



Figure 3. Representation of a tripartite process matrix W 
where one party has trivial output = 1 . It can be seen 
as connecting Alice's (Aq) and Bob's (Bq) output systems to 
Alice, Bob and Charlie's input systems Aj, Bi and C/. 

Therefore, when C's output space is trivial, we will 
call a tripartite process matrix causally separable 
if it is of the form 

(27) 

with 0 < < 1 . Ignoring the normalization constraint, 

this defines a convex cone We will use this def¬ 

inition in Section V to show that a recently introduced 


tripartite quantum resource, which yields information¬ 
processing advantages with respect to causally ordered 
processes [7, 8], is causally nonseparable. 

The generalization of the notion of causal separability 
to a larger number of parties, with arbitrary dimensions 
of the output spaces, is not trivial. The reason is that 
one can consider situations in which an agent, through 
her local operations, could modify a classical variable 
that determines the causal order of agents in her future. 
In such a "classical switch", operations would still be 
causally ordered in each rim of an experiment, but it 
wouldn't be possible to write the corresponding process 
matrix as a mixture of causally ordered ones. As this 
issue does not affect the cases treated here, we shall not 
consider it further. A more detailed analysis will be 
presented in an upcoming work [19]. 

III. CAUSAL WITNESSES 
A. Definition and characterization 

In this section we develop mathematical tools to iden¬ 
tify, in the bipartite case, which process matrices are 
causally separable and which are not. In analogy with 
entanglement witnesses [20], we call a hermitian oper¬ 
ator S a causal witness (or witness, simply) if^ 

tr[S W'^'^P] > 0 (28) 

for every causally separable process matrix W®®P. This 
definition is motivated by the separating h5rperplane 
theorem [21]: since the set of causally separable pro¬ 
cesses is closed and convex, for every causally nonsep¬ 
arable process matrix Wns there exists a causal witness 
Swns such that trjSw^^Wns] < 0. 

To construct a witness for a given nonseparable pro¬ 
cess, we will start by characterizing the set of all causal 
witnesses in terms of linear constraints on a convex 
cone. This will allow us to cast the problem of find¬ 
ing a witness as an SDP problem. First, note that (28) is 
equivalent to 

tr[SW'^^®]>0 (29a) 

tr[SW®^^]>0 (29b) 

Let us focus on condition (29a). Using Eq. (17) and not¬ 
ing that for any valid process matrix W, g^]N is a valid 
causally ordered process matrix compatible with the or¬ 
der A ~< B, one finds that (29a) is equivalent to 

tr[S(BoW )]>0 VWeUy, W> 0 . (30) 


' Note that the bound 0 and the sign of the inequality are arbitrary; 
we choose them as in Eq. (28) for mathematical convenience. 
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Thinking of the trace as the Hilbert-Schmidt inner prod¬ 
uct and noting that the map is self-dual, we have 
that 

tr[S(BoW)] =tr[(BoS)lV], (31) 

and it is sufficient that bqS > 0 for the right-hand-side 
to be non-negative for all valid W. An analogous argu¬ 
ment shows that > 0 is sufficient to satisfy condi¬ 
tion (29b). We conclude that for S to be a causal witness, 
it is sufficient that 

BqS > 0 and ,4^5 > 0. (32) 

Note also that adding an operator belonging to 
the orthogonal complement Cy of Cy to any witness 
S gives another valid witness, since tr[(S -|- S^)W] = 
tr[SW] for any valid process matrix W. It turns out that 
this suffices to completely characterize the set of causal 
witnesses, as stated in the following theorem: 

Theorem 1. A hermitian operator S G A; (g) Aq (g) B; ig) Bq 
is a causal witness if and only if S can be written as 

S = Sp + S^, (33) 

where Sp and are hermitian operators such that 

BoSp> 0 , AqSp > 0, Lv(S-'-) = 0. (34) 

The rather technical proof of this theorem is relegated 
to Appendix C. This theorem provides a characteriza¬ 
tion of the closed convex cone of causal witnesses S. 

Since does not change the expectation value 
tr[SW], it can freely be chosen to be for instance 

= Lv{Sp)-Sp, (35) 

so that S = Lv{Sp). This has the effect of restricting 
witnesses to the subspace of valid processes Ly, which 
have the following characterization: 

Corollary 2. A hermitian operator S G Cy is a causal wit¬ 
ness if and only if there exists a hermitian operator Sp G 
Aj (g) Aq such that S = Ly{Sp), g^Sp > 0 , and 

Ao^P ^ 0- 

This restricted set of causal witnesses is also a closed 
convex cone, which we denote by Sy = SOCy. 

One could define witnesses as belonging to Sy in¬ 
stead of S, since both sets are as powerful in detecting 
causal nonseparability. However, some physically moti¬ 
vated witnesses, such as those presented in Section V B 
(for the tripartite case), do not belong to Sy, which is 
why we use the more general definition that witnesses 
belong to S. 


B. Finding causal witnesses 

The previous characterization of the convex cone of 
causal witnesses allows one to efficiently check the 
causal nonseparability of any process matrix W through 
algorithms for semidefinite programming (SDP) [22]. 
They output a causal witness if W is causally non- 
separable, and an explicit decomposition in terms of 
causally ordered process matrices otherwise. 

The idea is simply to minimize tr[S W] over the cone 
of causal witnesses^ Sy, and check whether we obtain 
a negative value or not. Note that in order to make 
tr[S W] lower boimded (to avoid getting a value —00 
for causally nonseparable process matrices) a normali¬ 
sation constraint on the witnesses has to be imposed. 
This normalisation is arbitrary - any constraint that 
makes Sy compact suffices - and different normalisa¬ 
tion choices give rise to different interpretations for the 
value of tr [SW]. We shall normalise the witnesses by 
imposing that tr[S O] < 1 for every (normalised) pro¬ 
cess matrix O, for — tr[S W] can then be interpreted as a 
measure of causal nonseparability, as we shall see later 
in this subsection. In order to be able to use it in the 
SDP problem we still need to write this normalisation 
as a conic constraint. To do so, we extend the constraint 
tr[S n] < 1 to non-normalised process matrices by lin¬ 
earity: 

tr[S n] < tr[n]/do, (36) 

which is equivalent to 

tr[(l/do-S)n] >0 (37) 

for all n G W. Recalling that S is assumed to be in 
Sy C Cy, this means that 1 /do ~ S G Wf := W* D Cy, 
where W* is the dual cone of W - that is, the cone of 
hermitian operators that have non-negative trace with 
process matrices. 

To test the causal nonseparability of a given process 
matrix W, we are thus led to define the following SDP 
problem: 

min tr [SW] 

s.t. SeSy, 1 /do-S e Wf, 

which is written explicitly in terms of positive semidef¬ 
inite constraints in Appendix D. 

If the solution of the SDP problem (38) leads to a neg¬ 
ative expectation value of S, one can conclude that W 
is causally nonseparable, since SDP algorithms can be 


^ In principle minimizing over S instead of Sy would lead to the 
same value for tr[S W], but this causes technical problems as ex¬ 
plained in Appendix E. 
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guaranteed^ to find the optimal solution [22]. In such 
a case, the optimal solution S* provides an explicit wit¬ 
ness to verify the causal nonseparability of W. On the 
other hand, if tr[S* W] = 0 , one concludes that W is 
causally separable, and an explicit decomposition of 
W into causally ordered processes is given by the SDP 
problem dual to (38) (this can be seen explicitly from 
the representation of the SDP problem (39) given in Ap¬ 
pendix D). As shown in Appendix E, this dual is 

mintr[n]/do 

S.t. w -F n e new, 


where W®®P is the cone of non-normalized causally sep¬ 
arable process matrices, as previously defined. Further¬ 
more, the optimal value tr[n*]/iio of problem (39) is 
related to the optimal value tr[S*W] of problem (38) 
through 

tr[n*]/do =-tr[S*W]. (40) 

This gives an operational meaning to — tr[S*W]. As 
shown in Appendix E, this quantity corresponds to the 
minimal A > 0 such that 

j^(w + An) (4,) 

is causally separable, optimized over all valid, nor¬ 
malised processes O. In other words, it quantifies the 
resistance of W to the worst-case noise. This is an 
analogue of the measure of entanglement called gener¬ 
alised robustness, which quantifies the resistance of the 
entanglement of a quantum state to worst-case noise 
[23]. It turns out that for our case the interpretation 
of — tr[S*IV] as a measure of causal nonseparability is also 
tenable, as it respects some simple axioms that we pro¬ 
pose in Appendix F. For this reason, we define the gen¬ 
eralised robustness of a process W as 


i^g(W) = -tr(S*W). (42) 


Again in analogy with the case of entanglement mea¬ 
sures, one can also define the random robustness [24] of 
W as is its resistance to “white noise", which can be de¬ 
fined as the process that sends maximally mixed states 
to each laboratory, independently of the local opera¬ 
tions: 


1 ° := 


1 


(43) 


The optimal witness with respect to random robustness 
can be formd by solving an SDP problem analogous 


3 When the assumptions of the Duality Theorem (8) are satisfied, 
which is the case for our SDP problems, as proven in Appendix E. 


to (38): 


mintr(SW) 

s.t. S E Sv, tr(Sl°)<l, 


(44) 


whose dual is 


min A 

s.t. A > 0 , W -h Al° e W'P, 

and random robustness itself is defined as 

Rr{W) = -tr{S*W), (46) 

where tr(S*IV) is now the optimal value of the prob¬ 
lem (44). This quantity can be used to compare wit¬ 
nesses in scenarios where white noise is an appropri¬ 
ate noise model, however, it cannot be interpreted as 
a proper measure of causal nonseparability, as it does 
not respect all the axioms we propose in appendix F - 
more specifically, it is not monotonous rmder local op¬ 
erations. 

A geometrical interpretation of the results of this sec¬ 
tion is shown in Figure 4. 


C. Implementing causal witnesses 

Once a causal witness S has been obtained for a 
given causally nonseparable process matrix W, a nat¬ 
ural question is how to "measure" it, i.e., how to access 
the quantity tr[S W] - and, in particular, check its sign 
- experimentally. 

To do so, note that as S G A/ O Aq O Bj O Bq is a 
hermitian operator, it can always be decomposed as a 
linear combination of the form^ 

s= E (47) 

x,y,a,h 

where 'yx,y,a,h ^re real coefficients and and 

B B 

° are positive semidefinite matrices that can be 


+ In the decomposition (47), x, y, a and h should a priori simply be un¬ 
derstood as labels for and . We can however assume, 

without loss of generality, that (Eo ^ and 

Bo (Ei) ) < l®(®o/dBo lor all x,y (we can indeed always 

include scaling factors in the coefficients 'rx,y,a,b)- Introducing, 
when required, some complementary positive semidefinite oper¬ 
ators and (with null coefficients jy,, „ 1,), so that now 

and Bo(&M®(;°) = 1^^'^o/dg^^ 
the sets and can then be interpreted as the 

Cl representation of instruments, for which x, y are inputs and a, b 
are outputs. 
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Figure 4. Here we schematically represent the set of nor¬ 
malised process matrices in W by the red ellipse and the set of 
normalised causally separable processes in by the blue 
ellipse. Since the latter set is closed and convex, any causally 
nonseparable process W is separated from it by a hyperplane, 
corresponding to an operator S which we call a causal witness. 
In the figure we represent two such causal witnesses, and 
Sr^, that represent two different ways to quantify how far W is 
from being causally separable. — tr(SRg W) measures the gen¬ 
eralised robustness of W, which is its resistance to the worst- 
case noise Q. Geometrically, the generalised robustness of W 
is given by the ratio of distances d{W, WS) /d(W8, Q), where 
WS is the causally separable process closest to W on the de¬ 
picted line. In its turn, — tr(SRj.W), the random robustness of 
W, is its resistance to the "white noise" 1 °. Geometrically, it 
is given by analogous ratio d(W, W) /d(W'', 1“), where W is 
again the causally separable process closest to W on the de¬ 
picted line. Sr^ and Sr^, are the optimal solutions of fhe SDP 
problems (38) and (44), respectively. 


interpreted as the Choi-Jamiolkowski representation of 
CP trace non-increasing maps (see Section II). 
Expanding tr[S W], 


or its trace with any valid process. Hence, it actually 
suffices to find a decomposition for S + for some 
arbifrary S^, implement the corresponding maps, and 
combine their statistics as above. 


D. Example 


Let us now illustrate the above considerations on an 
explicit example. Ref. [13] introduced the following 
process matrix, for a case where all incoming and out¬ 
going systems of A and B are 2-dimensional (qubif) sys¬ 
tems (i.e., dA, = dAo = = dso = 2): 


I^OCB — ^ 


7! . 


(49) 

where Z and X are the Pauli matrices, and tensor prod¬ 
ucts are implicit. One can easily check that Wocb ^ 0 , 
that tr[WocB] = 4 = dp, and that WocB satisfies 
Eqs. (9)-(ii), which ensures thaf if is indeed a valid 
process matrix. It was shown that Wocb allows for a 
violation of a causal inequalify (see Section VI), which 
implies that it is causally nonseparable. 


The concept of causal wifnesses introduced here al¬ 
lows us to prove the causal nonseparability of Wocb 
more directly. Solving the SDP problem (44) with 
YALMIP [25] and the solver MOSEK [26], we obtained, 
up to numerical precision, the optimal witness with re¬ 
spect to random robustness 


SoCB — 


1 - 
4 . 


1 - (1L^JZ^0Z®H®0 + Z^H^ox®JZ®o) 


(50) 

Applying it to Wocb/ we find fhat — tr[SocB ^ocb] = 
Rr(bVocB) = \/2 — 1 > 0 (where Rr(kVocB) is the 
random robustness as defined in Equation (46)). This 
proves that Wocb is causally nonseparable. 

This also implies that the process matrices of fhe form 


fr[SW]= ^ W], (48) 

x,y,a,h 


Wocb (A) = (Wocb + A1°), (51) 


where according fo fhe generalized Born rule (3), the 
terms tr ® represent the probabili¬ 
ties P that the maps and 

are realized. We assume that these CP maps can be im¬ 
plemented even if fhe causal order of the parties is not 
well-defined. The quantity tr[S W] can thus in princi¬ 
ple be implemented experimentally by estimating the 
probabilities and combining them as 

in Eq. (48). 

The decomposition (47) is not unique. Eurthermore, 
as noted before we can add to any witness S a term 
such that Ly(S^) = 0 without changing its validity 


are causally nonseparable for 

A<Rr(WocB) = V2-I (52) 

(and fheir causal nonseparabilify is then witnessed by 
Socb)- For A > \/2 — 1 / Wocb(A) is causally separa¬ 
ble; the solution of fhe SDP problem (45) provides an 
explicif decomposition for WocB(f^r(WocB)) (as can be 
seen when wrifing (45) in a form similar to Eq. (D6)), 
from which we can derive an explicif decomposition for 
all Wocb (A) for A > V2 - 1, as 

Wocb(A) = iw^ct (A) + (53) 
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where 




1 

4 

1 

4 


n 

1 + — i^iz^oz^it^o 

1 + A 

1 + z^‘t^oX^‘Z^o 
1 + A 


( 54 ) 

( 55 ) 


are causally ordered process matrices. (Note that for 
A < -v /2 — 1 , Wq(^^(A) and W’q^^(A) as defined above 
would not be positive semidefinite, which explains why 
Eq. (53) then fails to provide a valid causally separable 
decomposition of Wocb(A).) 

To measure the witness Sqcb arid obtain the quan¬ 
tity tr[SocB ■ IV] experimentally, one can for instance 
decompose it in the following way: define, for 
X, y, y', a,b = 0 , 1 , the CJ matrices 


M" 


b\y,y=o 


b\y,y'=l 


( 56 ) 

-2- ) ® (-2- ) ' ( 57 ) 


^ ^ , (58) 


which represent measure-and-prepare maps (see Ap¬ 
pendix A 1). One can then check that 

SocB = 3 ■ 1° — 4 Gqcb (59) 


with 


Gqcb = n 


x,y,a,h 


5 a V M", 

a\x 


+ 5 b r m: 


’(8)M. 


bly,y '=0 


^(g)M, 


B/Bo 

b\y,y'=l 


(60) 


where is the Kronecker delta. Thus, one can com¬ 
pute tr[SocB ■ IV] by performing the maps above on W 
and combining the probabilities P= 

■ W] as follows: 

tr[SocB ■ W] = 3 - 4 tr[GocB ■ W] 


x,y,a,h 


+%.fK+«K=.) -( 61 ) 


= 3 - 4 . i E 

B,Br 


M ■ 


As one may recognize, the choice of CP maps in (56)- 
(58) is the same 5 as that considered in Ref. [13], so 


5 Note that compared to Ref. [13], we exchanged in the present paper 
the notations x, y and a, b for inputs and outputs, so as to use here 
the same notations as most of the recent works on quantum and 
nonlocal correlations [27]. Furthermore, in [13] the state sent out 
by B when y' ~ i was arbitrary, while here we fixed it to be /2. 


that the experimental procedure proposed here to mea¬ 
sure the witness Sqcb would be the same as that 
suggested in [13] to violate a causal inequality. The 
labels x,y,y',a,b can be considered as inputs and 
outputs for the above maps (which indeed satisfy 

and = 

for all x,y,y'). As it turns out, in the causal 
inequality of Ref. [13] the probabilities P[a,b\x,y,y') = 
are actually combined in precisely 
the same way as above - namely, tr[GocB ' IV] above 
can be identified with the probability psucc of wirming 
the corresponding “causal game", 

psucc = ^[P{a = yjy' = 0 ) + P(l 7 = xjy' = 1 )], (62) 

when the inputs x,y,y' = 0,1 are given with equal 
probabilities. 

Remarkably, in this particular case the bounds of 
the causal witness Sqcb ^rid of the causal inequal¬ 
ity (62) coincide, i.e., tr[SocB ' ^] ^ 0 if and only if 
Psucc = h[GocB ■ fV] < 3 / 4 , where 3/4 is the upper 
boimd on psucc for any causal correlation (as defined in 
Section VI below). Furthermore, the noise threshold be¬ 
low which the noisy process matrix IVocb(A) (53) can 
violate the causal inequality is the same as the thresh¬ 
old Rr(lVocB) below which IVocb(A) is causally non- 
separable, as already noted in Ref. [28]. This is however 
not a general property of causal witnesses and causal 
inequalities: similarly to the case of entanglement vs. 
quantum nonlocality and of entanglement witnesses vs. 
Bell inequalities [27], there exist causally nonseparable 
process matrices that carmot yield any violation of any 
causal inequality - while there always exists a causal 
witness that detects their causal nonseparability. We 
will come back to this issue in Section VI below, with 
an explicit example in the tripartite case. 


IV. QUANTUM CONTROL OF CAUSAL ORDER 
A. The quantum switch 

It has recently been suggested that quantum compu¬ 
tation can be extended beyond the framework of quan¬ 
tum circuits, which enforces a fixed order between the 
execution of quantum gates. The main idea is that the 
order in which gates are performed can be coherently 
controlled by a quantum system. The new resource that 
allows for such a control is the quantum switch, first 
proposed in Ref. [6]. It works as follows: consider a 
two-qubit system, composed of a control and of a target 
qubit. Two parties A and B act on the target qubit with 
the imitaries U^, Ub respectively. If the control qubit 
is prepared in the state |0), Ua is applied to the target 
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before Ug, while if the control is in state | 1 ) the two 
unitaries are applied in the reversed order. The global 
unitary, acting on both the target and control qubits, is 
thus 

V(Ua, Ub) = |0)(0| 0 UbUa + | 1 )( 1 | «> UaUb, (63) 

where the first factor in each tensor product acts on 
the control system and the second factor acts on the 
target. For an initial state IV’) of control- 

target system, one gets, after applying V, the state 
-^(|0) ig) UbUaI*/’) + |1) ® which can be in¬ 

terpreted as having applied the two unitaries on the 
target in a "superposition of orders"^. 

Nofe fhat if fhe confrol system is discarded, one is 
left with the mixed state 

^(UgUAly’XflUA^B + llAUBmmM)- (64) 

This can be produced by randomly exchanging the or¬ 
der in which Ua and Ug are applied and thus can be 
seen as an equal mixture of causally ordered processes. 
To make the situation more interesting, we shall be led 
to introduce a third party, C, who can perform mea¬ 
surements on the control qubit (and possibly also on 
the target qubit) in order to define a causally nonsepa- 
rable process (using fhe definition (27)) using quantum 
control of causal order. 

B. Process matrix representation of the quantum switch 

For our purposes, we can formally represent the 
quantum switch (with fixed inpuf state) as a tripartite 
process matrix: the two parties A and B perform an 
arbifrary CP map each on fhe farget qubif, while C per¬ 
forms an arbitrary two-qubit POVM measurement on 
the resulting control-target state (with no outgoing sys¬ 
tem). The dimensions of input and output systems of 
the local laboratories are therefore 

= dgQ = 2 , dci = 4 , dcQ = 1 . 

(65) 

For clarity, we shall divide C's input space as C/ = 
C) (8 Cj, where C) and C\ refer to the control and target 
qubits, respectively (with therefore d^c = d^^t = 2). 

In order to describe the process matrix of fhe quan¬ 
tum switch, we are first going to make use of fhe "pure" 


^ Since any CP map can be purified to a unitary evolution by intro¬ 
ducing an ancillary system and a projective measurement on some 
subsystem of the original system and ancilla, the notion of super¬ 
position of orders can be easily extended from unitary operations 
to arbitrary CP maps by introducing an ancillary register for each 
party. 


version of fhe formalism, described in Appendix A 2. 
An identify charmel from a party's output space Aq 
to another party's input space Bj is described, as a 
process matrix, by the projector onto the "process vec¬ 
tor" ■ Tbe situation where 

A receives a state \ip), performs an arbifrary opera¬ 
tion on if, and sends the output directly to B through 
an identity channel, who in turn sends the output of 
his operation to Cj, is represented by the process vec¬ 
tor see Appendix A 2. Then the 

quantum switch, with the control qubit initially in the 
state and the target qubit in the state \^), is rep¬ 

resented by the process matrix \ w){w\, where 

+ ( 66 ) 

This can be checked by noting that 

{{U*A\^‘^°{{U*Bf^’^° ■ |ZP) = 

0 {UBUA\^)f' + 0 {UAUB\iP)f‘), 

(67) 

where := 1 0 bfj^|l)) is the "pure" CJ repre¬ 
sentation of Ua, and similarly for (and with 

((If* I = I U *))^); see Appendix A1. Note that the result¬ 
ing state is the same as that obtained by applying (63) 
to an initial state 0 11/’). 

Note that the process (66) itself is clearly causally 
nonseparable^, since i) if is a superposifion of a pure 
process only compatible with the order A -< B -< C 
and a pure process only compatible with the order 
B ~< A -< C and ii) it is a projector onto a pure vec¬ 
tor, thus it cannot be written as a nontrivial mixture of 
causally ordered processes. 

From Eq. (66), one finds (using the facts that 

trc.(|l))((l|''°''^) = 1^0 and trc<(|l))((l|^°''b = l^o) 
that by tracing out C, one gets 

trQ |w)(w| = frqct |zf)(zf| = (68) 

where 

W'^^® = 0 |l))((l|'^o®^ 0 l®o, (69) 

W®^'^ = 0 0 (70) 


z Note that the results of Ref. [6] only show that the quantum switch 
cannot be realized by a circuit with a fixed order of gates, but the 
more general notion of causal (non)separability was not considered. 







11 


are (bipartite) causally ordered process matrices; 
trQ |h;)(k;| indeed describes the situation of Eq. (64). 

For some information-processing tasks, the quantum 
switch is known to provide an advantage over causally 
ordered processes [7, 8], even when C ignores the tar¬ 
get system and only measures the control system. We 
will thus restrict our attention to witnesses of the form 
(?) which can simplify the analysis and the ex¬ 
perimental implementation. The reduced process we 
will be dealing with is the partial trace of the quantum 
switch (66) over the target system: 

^switch = (71) 

Note that the proof of causal nonseparability based on 
the purity of the switch does not extend to the reduced 
switch (71), since it is not an extremal process. We 
will therefore use the framework of causal witnesses to 
show that the reduced switch is also causally nonsepa- 
rable. 


where y '■= H with the dual of the 
cone yV^c of (non-normalized) tripartite process matri¬ 
ces with d-CQ = 1. 

Analogously to problems (38)-(39), the dual of (76) 
writes 

mintr[n]/do 

SPD (77) 

S.t. w n e n e W3C, 

and the optimal values of (76) and (77) respect the dual¬ 
ity relation (40), which allows us to interpret — tr(S* W) 
as generalised robustness also in this case. Further¬ 
more, (76) and (77) respect the assumptions of the Du¬ 
ality Theorem, and therefore SDP algorithms can find 
their optimal solutions efficiently. We shall, however, 
omit the proofs, as they are simply a slight modifica¬ 
tion of the ones already presented in Appendix E. 

A. Optimal witness 


V. WITNESSES FOR THE QUANTUM SWITCH 

Since the quantum switch is a tripartite process 
where dco ~ definition (27) to study its 

causal (non)separability. In this tripartite situation, we 
will define causal witnesses to be the hermitian opera¬ 
tors S such that 


tr[S > 0 (72) 


for every causally separable processes W®®P in the cone 
Wg^^. The set of causal witnesses is thus the cone dual 
to Wg^^, which we denote by S3C, or when re¬ 

stricted to £y. The characterization of S3C is given by 
the following theorem: 

Theorem 3. A hermitian operator S € A; (g) Aq (g) 67 (g) 
Bq (g) C/ (g) Co mith dc^ = 1 is a causal witness if and only 
if S can be written as 


where 


S — S^BC + SabC — ^BAC + ^BAC' (73) 

Sabc — 0 / ^aaBac{Sabc) — 0 ' ( 74 ) 

Sbac — 0' ^b^a^c{Sbac) ~ (75) 


luith La^b^c Lb^a^c defined in Subsection 

The proof is given in Appendix G. This characteriza¬ 
tion allows us to cast the problem of finding a witness 
for the quantum switch (or in fact for any process W 
with dco = 1) problem analogous to (38): 


mintr(SW) 

s.t. S e S3cy, t/dQ — S €z Wgo^y , 


(76) 


To find the optimal generalised robustness witness 
for the quantum switch we need to solve SDP prob¬ 
lem (76) providing Wg^vit^h from Eq. (71) as an argu¬ 
ment. Solving it using YALMIP and the solver MOSEK 
we obtain a witness Soptimal numerically; the gener¬ 
alised robustness of the quantum switch is found to be 


^g( ^switch) fr Soptimal ^switch ~ 0 . 5454 . ( 7 ^) 

Later in this section we will compare this number to 
that obtained from non-optimal witnesses. For this pur¬ 
pose, we shall use the amount of worst-case noise tol¬ 
erated by a witness, i.e., the amount of worst-case noise 
that can be added to the quantum switch before the wit¬ 
ness can no longer detect its causal nonseparability. It 
should be clear that, when the said witness is optimal, 
this number reduces to the generalised robustness of 
the quantum switch. 


B. Chiribella's witness 

In Ref. [7] Chiribella proposed an information¬ 
processing task for which the quantum switch had an 
advantage over causally ordered processes. We want to 
understand what this advantage means, and how it re¬ 
lates to causal nonseparability. For that we shall present 
a slightly modified version of his task and show how it 
can be understood as a causal witness. 

Our version of the task is as follows: Alice (party A) 
receives a qubit in her lab, applies a unitary Ua to it, 
and sends it away. Bob (party B) receives a qubit in his 
lab, applies a unitary Dg to it, and sends it away. We 
assume that in each run of the experiment, Ua and Llg 
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either commute or anticommute. Charlie (party C) re¬ 
ceives a qubit in his lab, and makes a measurement on 
it to decide whether Uj\ and Ug commute or anticom¬ 
mute. 

To construct a causal witness in relation to this task, 
we start with the Choi-Jamiolkowski representation of 
the actions of the parties: Alice applying a unitary Ua, 
Bob applying a unitary Lfg, and Charlie obtaining the 

result ± when measuring in the |±) = basis. 

Using the CJ representations lU^)) and |Ug)) of 11 ^ and 
Ub (see Appendix A i), the corresponding operator is 

gU^,Us ^ I I ^ II ^ I ±)(± I. (79) 


will use Ua = UXy+ and Ug = VZV^, where V is 
also a Haar-random unitary (and X and Z are the Pauli 
matrices)^. 

With these measures Gchiribella turns out to be a valid 
causal witness, as the maximal probability of success 
for causally separable processes Psucc is bounded below 
one. To calculafe it we need to solve the following SDP 
problem: 

max tr [Gchiribella tv] 

seD (83) 

s.t. frW = do. tVeWg^. 

Solving if with YALMIP and MOSEK, we obtain 


The witness corresponding to the task is obtained by 
averaging over the cases where Charlie obtains -)- when 
Alice and Bob apply commuting unitaries, and the 
cases where Charlie obtains — when Alice and Bob ap¬ 
ply anticommuting unitaries: 

Gchiribella ~ 2 J 2 j , 

(8o) 

where dp [ j is a measure over commuting unitaries, and 
dp I j is a measure over anticommuting unitaries (we 
assume here that the cases where Ua and Ug commute 
and anticommute each appear with probability j). The 
probability of success in this task when the parties are 
using a strategy described by a process matrix W is then 

Psucc = h [Gchiribella^] • (8l) 

It is easy to check that for any choice of measures 
dpj j,dp| j fhe probability of success is 1 when W = 
Wgwitch- The maximal probability of success for causally 
separable processes, however, depends crucially on the 
measures dp[ ] and dp| p If we were to choose, for ex¬ 
ample, measures that only produce pairs of Pauli ma¬ 
trices, then there is a causally separable circuit® that can 
decide the commutativity or anticommutativity with 
probability 1. 

To avoid this problem we will first choose measures 
that can produce any pair of commuting or anticom¬ 
muting imitaries (modulo global phases). Specifically, 
we choose fhe commuting measure dp[ ] to pick up 
commuting imitaries of fhe form 

Ua = u(^^ and Gg = (82) 

where U is uniformly disfributed according to the Haar 
measure, and 0, are uniformly distributed in the inter¬ 
val [0,2/1]. For the anticommuting measure dp( j, we 


® One such circuit, described in Ref. [7], involves applying the Pauli 
unitaries to one half of a maximally entangled state and doing a 
measurement in the Bell basis. 


p^^Pc« 0 . 9288 . (84) 

The amount of worsf-case noise fhat Gchiribella can fol- 
erate is 0 . 0766 , which is much worse than the 0.5454 
tolerated by Soptimal- 

An issue wifh Gchiribella is that it would take an in¬ 
finite number of measuremenfs to estimate each term 
of fhe sum in (80). Furfhermore dp[ ] and dp^ j were 
chosen arbifrarily, while it would be preferable to have a 
justification for the choice of a particular measure. Both 
problems are solved by restricting the unitaries Ua and 
Ub to come from a finite set. In this way we only need 
perform a finite number of measurements to estimate 
the witness, and it is possible to optimize the measures 
over commuting and anticommuting unitaries through 
SDP problems. 

The best witness we found is obtained by choosing 
the following fen unitaries: 


G 


{ 1 ,X,Y,Z 


X + Y X-Y 


x-hz x-z Y-hz y-z., 


(85) 


(Y being the third Pauli matrix), and defining the wit¬ 
ness to be 


^ [,] . 

Gfinite — 2-1 ^ij G+ 

U=1 




( 86 ) 


where Uf^ G G, and are the input probability 

distributions over commuting and anticommuting uni¬ 
taries, normalised such that = 1- 


5 It turns out that with this choice of measures the witness Gchiribella 
is the same as we would obtain by translating the task from Ref. [7] 
directly into the language of causal witnesses; the only difference, 
then, is that in [7] the witness was decomposed in terms of mea¬ 
surements and repreparations, whereas we decomposed it using 
unitaries only. 
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To obtain the weights ^ and pl^cc we solved an 

SDP problem presented in Appendix H. We obtained 

0.8690, (87) 

and tolerance to worst-case noise 0.1507, which is 
higher than Gchiribella's 0.0766, but still lower than 
^optimal S 0.5454. 

We want to emphasize that the witnesses obtained 
in this subsection are equivalent to the ones defined 
through (72) in the beginning of fhe present section 
- the only difference being fhe arbifary choice of fhe 
causal bound being > 0 vs < Psucc- More precisely, lef 
G be a witness such that 


tr(GW^^P) < p^^P, (88) 


for every (normalised) causally separable W®^P and 

To<tr(GW)<ri (89) 


for every (normalised) process mafrix W. Then 


S = 




(90) 


is a valid generalised robustness witness. Furthermore, 
if S is fhe optimal wifness for some process matrix W 
that saturates the upper bound tr(G W) = Tj, it follows 
fhat 

rp sep 

Rg(W) = -tr[SW]= Lp^' 7 - (91) 

Psucc ^ 0 


When G is eifher Ggnite or Gchiribella/ we have fhat 
Tq = 0 and Tj = 1. And even though they are not 
0|)timal witnesses for Wg^itj-j^, fhe relationship befween 
Psucc sod resisfance to worst-case noise is valid for 
fhem, i.e., for bofh Gf^ite and Gchiribella the resistance to 
worst-case noise is equal to 1 / psucc “ 1/ as given by (91). 


VI. CAUSAL INEQUALITIES 


The notion of causal separability considered above 
relies on the quantum description of fhe local laborafo- 
ries. One may ask what are the constraints imposed by 
a definite causal structure regardless of the specific de¬ 
scription, or even the physics governing the devices per¬ 
forming fhe local operations. To study such restrictions, 
we will make use of so-called causal inequalities [13], 
which bound the possible correlations that can be estab¬ 
lished between events following a definife causal order. 
The violation of a causal inequalify gives a sfronger, 
device-independent signature of lack of causal order 
than the measurement of a wifness. It is natural to ask 
whether it is possible to use the quantum switch to vio¬ 
late a causal inequality; we show below that this is not 
the case. 


A. Device-independent causal relations 

We still consider a multipartite scenario in which a set 
of N parties are located in different, separated 

laboratories. Each party can perform operations and 
obtain measurement outcomes. Contrary to the previ¬ 
ous case however, we do not consider here any partic¬ 
ular physical description of what happens in each lab; 
the "settings" for fhe operations in the different labo¬ 
ratories and the measurement outcomes are labelled by 
some classical variables x, and a, (with 1 < i < N), re¬ 
spectively; for simplicify we assume thaf fhe x,'s and 
A,'s fake a finife number of values. Defining fhe vector 
of seftings x = (xi,... x^) and the vector of outcomes 
a = (aj, ..., An), the device-independent description of 
the correlations established in such an experiment is en¬ 
coded in the conditional probability P(a|x). 

Causal inequalities [13] are constraints on P(a|x) de¬ 
rived from fhe assumpfion thaf fhere exisfs an under¬ 
lying causal strucfure defining fhe order befween par¬ 
ties. To be more precise, let us represent the causal 
order in which the parties act by a permutation cr, de¬ 
fined such fhat party i acts before party j if and only if 
cr(z) < cr{j). This leads fo a total ordering of fhe parties, 
namely -< We fhen say thaf 

a probabilify distribution P(a|x) is compatible with the 
causal order <7 if no parfy signals to those before her^°, 
namely if for every i the marginal distribution 

P(a^{i),...,a^(,)|x) := (92) 

i>i 

does not depend on the inputs Xp.(y) with j > i; i.e., 

~ ■ • ■ > ^cr(i)' ' ' '' ^cr(N)) 

(93) 

A probability distribution that is compatible with at 
least one causal order a is said to be causally ordered. 

More generally, we allow the parties to share ran¬ 
domness to agree on a specific order of sending signals 
befween them before fhe inpufs of fhe game are given fo 
fhem. This allows for convex combinations of causally 
ordered probabilify disfribufions: 

P(a|x) =Y^qa P(ria\x), qa> 0 , = 1 , (94) 

cr cr 

where each Pa is compatible with a fixed order a. These 
are still not the most general correlations compatible 


Note that this condition is strictly stronger than no-signalling to 
each individual party, since it is possible to signal to a group of 
parties without signalling to any individual party. 





with the assumption of a definite causal structure, as 
one party could control the causal order of a set of par¬ 
ties in its future [19, 29, 30]. Correlations compatible 
with this most general scenario of definite causal or¬ 
der are called simply causal. In the bipartite case, the 
set of causal correlations forms a convex polytope, de¬ 
limited by a finite number of facets that define causal 
inequalities [31]. The explicit definition of causal corre¬ 
lations in the general N-partite case is, however, rather 
cumbersome, and for the purposes of this article it will 
be enough to consider probability distributions of the 
form (94), which is a sufficient (although not necessary) 
condition for causal separability. 

As causally separable processes can only generate 
causal correlations, the violation of a causal inequality 
can also be used to detect the causal nonseparability of 
a process. While causal witnesses are device-dependent 
and can only detect causal nonseparability if each party 
trusts her operation's implementation, causal inequali¬ 
ties are completely device-independent: even if each party 
distrusts her laboratory, they can still detect causal non¬ 
separability from the statistics of their experimental 
outcomes, if those violate a causal inequality. While 
for every causally nonseparable process there is causal 
witness that will detect its nonseparability, there are 
causally nonseparable processes cannot be used to vi¬ 
olate any causal inequalities: in the next subsection 
we will prove that the quantum switch provides such 
an example. There is an analogy here with entangle¬ 
ment witnesses, which allow for a device-dependent 
way of detecting entanglement, and Bell inequalities, 
which provide a device-independent entanglement cer¬ 
tification - "nonlocality" [27]. The important differ¬ 
ence is that states violating Bell inequalities are physi¬ 
cally implementable, while no example of a physically 
implementable process violating causal inequalities is 
known. 


B. Quantum control of orders and causal inequalities 

One might first wonder if the quantum switch allows 
for a causal inequality violation between A and B (such 
as the bipartite causal inequalities of Refs. [13, 31]); this 
is however clearly not the case since, as pointed out be¬ 
fore, ignoring (i.e., tracing out) the third party C makes 
the process matrix of the quantum switch causally sep¬ 
arable. 

One might still hope that the quantum switch can be 
used to violate a tripartite inequality (see e.g. [30]), ex¬ 
plicitly involving party C; as it turns out, this is also im¬ 
possible, as a consequence of the following theorem^^: 


Theorem 4. Consider N-\-l parties {A^,, A^,C} with 
settings {xi,..., x^, z} and outcomes {«!,... c}. If the 

marginal distribution 

P(fl|x,z) := ^P(fl,c|x,z) (95) 

C 


is such that 

1. P(fl|x,z) = P{d\x) - i.e., it does not depend on z: C does 
not signal to any other (group of) parties; 

2. P{d\x) = qa-Pa{a\x), where qa > 0 , = 1 , and 

the probability distributions Pa are causally ordered, 

then the full -partite probability distribution 

P{d,c\x,z) is causal. 

Proof. Using Bayes' rule and the assumptions of the the¬ 
orem, we can write 


P(fl, c|x,z) = P(fl|x,z) P(c|fl, x,z) (96) 

= qaP(T{a\x) p{c\d,x,z) (97) 

O' 

= J^qaPa{a,c\x,z), (98) 

O' 


where Pa{d,c\x,z) := Pir{d\x) P{c\d,x,z) is compatible 
with the order C; this shows 

that P(fl,c|x,z) is causal. □ 

To see that the correlations generated by the quantum 
switch (Eq. (66)) respect assumptions 1. and 2. of the 
previous theorem, let us calculate the marginal proba¬ 
bility distribution defined in Eq. (95) through the gen¬ 
eralized Born rule (3), when the three parties perform 
operations and : 

r a\x b\y c\z 


P{a,b\x,y,z) = ^ tr 


(g) (g) Nff ■ |a;)(H;| 


b\y 


c\z 


= tr 


o (g 0 (E^c?z) ■ • (99) 


Since the third party C has no output space (dco = 1 )/ 
then for any instrument we have Yjc = 

so that 


P{a, h\x,y,z) = tr 


a\x b\y 


(100) 


with 


y^AB ._ |w)(k;| . (101) 

This implies that P{a,b\x,y,z) does not depend on z, 
as required. As argued before, tracing out C from 


A similar conclusion based on the same example has been obtained 


by Oreshkov and Giarmatzi independently of the other authors of 
this paper and is presented in Ref. [19] 
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the process matrix representing the quantum switch 
leads to a causally separable process matrix of the form 

with causally ordered pro¬ 
cess matrices W^^® and W®^"^, which can only gen¬ 
erate causally ordered probability distributions Pa^b 
and Pb^a- Hence, P{a,b\x,y,z) can be decomposed as 
jPAAB{‘i,b\x,y,z) + 2-Pb^a(«/ b\x,y,z), so that the sec¬ 
ond assumption of Theorem 4 is also satisfied. 

Therefore, the quantum switch represents an exam¬ 
ple of a causally nonseparable process that can only 
generate causal correlations, and hence cannot be used 
to violate any causal inequality^^. It is noteworthy that 
all the examples of causally nonseparable processes for 
which a physical interpretation is known, including 
those generated by space-time superpositions [32], fall 
into this category. This raises the question of whether 
causally nonseparable processes that do violate causal 
inequalities can be physically implemented at all. 

VII. CONCLUSION 

The process matrix formalism was originally con¬ 
ceived as a rather speculative extension of quanfum me¬ 
chanics to possibly include the indefinite causal struc¬ 
tures expected in a quantized theory of gravity [10]. 
The results of this work show that, in fact, it is a natu¬ 
ral framework to study a class of quantum resources 
which cannot be captured by the circuit model, but 
nonetheless are physically realizable and can provide 
powerful compufational advantages. We have shown 
that the quantum switch, a recently demonstrated re¬ 
source for quantum computation, can be conveniently 
represented as a causally non-separable process matrix. 
We have also presented causal witnesses that can verify 
the causal nonseparability of the switch. As they only 
require performing imitaries in a “superposition of or¬ 
der" and a final measurement of a control qubit, such 
witnesses can be easily implemented in quantum-optics 
setups, as the one employed in Ref. [9]. 

The theory of causal wifnesses developed here has 
close resemblances with the theory of entanglement 
witnesses. In both cases, one is interested in finding 
ways to certify thaf a resource is outside some convex 
set, the set of separable states in the latter case, that 
of causally nonseparable process matrices in the for¬ 
mer case. Following this analogy, causal inequalities 
can be seen as the counterpart to the Bell inequalities, 
as they both provide device-independent tests regard¬ 
ing the existence of some classical variable: local hid¬ 
den variables for measurement outcomes in one case. 


Note that Theorem 4 implies that this is also true for the N-partite 
generalization of the quantum switch defined in [8]. 


classical variables determining the causal order in the 
other. A significant difference between the two frame¬ 
works is thaf the problem of determining causal sepa¬ 
rability can be solved numerically with efficient algo¬ 
rithms, whereas characterizing entanglement has been 
proven to be an NP-hard problem [33]. 

As one could expect from the analogy with entangle¬ 
ment, there exist causally nonseparable processes that 
cannot violate causal inequalities. What is striking, in 
the case of process matrices, is that a physical interpre¬ 
tation is known only for resources in this category. As 
one of the main open problems in this field is the char¬ 
acterization of physical process matrices, it is tempt¬ 
ing to speculate whether the (im)possibility to violate 
causal inequalities could provide a useful guidance in 
this respect. 
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Appendix A: Details of the formalism 

Here we explore in more details the properties of the 
Choi-Jamiolkowski (CJ) isomorphism and of the pro¬ 
cess matrix formalism. Note that other existing def¬ 
initions of the CJ isomorphism differ by a transposi¬ 
tion or a partial transposition from the one given here, 
which follows the convention in [13] and allows a direct 
identification of non-signaling processes with quantum 
states. 


1. Choi-Jamiolkowski isomorphism 

a. Pure C} isomorphism. It is convenient to distin¬ 
guish two versions of the CJ isomorphism: one for 
maps over density matrices and one for linear opera- 
fors on pure state. The latter - the "pure CJ isomor¬ 
phism" - can be represented via the "double-ket" nota¬ 
tion [34, 35]. For a linear operator A : -A we 
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define ^3 

(Ai) 

where |1)) = := Ej\i)^‘ ® I//' e 

(with also, of course, the usual notation ((1| = |1))^), 
and the complex conjugation * is defined with respect 
to the chosen orthonormal basis {|/)^^} of 'H^^. The 
inverse map is given by 

A\ip) = [{ip\^> (E) 1^0 . (A2) 

We say that |A*)) is the CJ representation (or CJ vector) 
of A. The cumbersome complex conjugation in the defi¬ 
nition allows us to have a simpler representation for the 
process matrix. 

b. Maximally entangled states and unitaries. Con¬ 

sider here the case where the input and output spaces 
have equal dimensions, d^^ = d/^^. The state obtained 
by applying a local unitary to one subsystem of a max¬ 
imally entangled state is also maximally entangled, in 
reverse, it is possible to generate any (bipartite) maxi¬ 
mally entangled state by applying a local unitary to one 
subsystem of a reference maximally entangled state. 
Therefore, the CJ vector = 1 lf*|l)) zs max¬ 

imally entangled if and only ifU is a unitary. More explic¬ 
itly, an operator 

u = E^ik\m (A3) 

jk 

is unitary if and only if = ^j,k for 

all j, k. One can check fhat this is also a necessary and 
sufficient condition for which 

= (A 4 ) 

jk 

is maximally entangled. 

c. Measurement-preparation. Another useful linear 
operator is \ip){(p\, which describes the observation of 
an outcome |(j)) in a projective measurement, followed 
by the repreparation of a state \ip). Plugging this into 
the definition (Ai), we find the CJ representation 

= (A5) 

Reciprocally, every pure product CJ vector represents a 
measurement-preparation operation. 

An important particular case is when \ip) = \(p), 
which corresponds to the ideal non-demolition von 
Neumann measurement: 

= (A6) 


“3 Superscripts on CJ vectors and CJ matrices indicate the systems 
they refer to (they may be omitted when the context makes it clear 
enough). 


d. Mixed C] operators. For the general case of a lin¬ 
ear map A 4 ^ : Ai —t Aq, we define the CJ isomorphism 
as 

M^’^o := [IEM^{\1)){{1\)]'^ . (Ay) 

It is easy to verify fhat the definition (Ay) reduces 
to (Ai) for operators of fhe form A 4 ^{p) = ApA^, i.e. 
that, in such a case, 

M=|A*))((A*| (A8) 

(with I a*)) = \ A*))^'^° and ((A*| = |A*))^). 

According to Choi's theorem [36], a linear map : 

Aj —t Aq is CP if and only if its CJ matrix is posi¬ 
tive semidefinite, M^^^o > 0 . A characterization of the 
trace-preserving condition can be found using the in¬ 
verse CJ isomorphism, 

M^ip)=[ tvA, [p^‘ ® ^. (Ag) 

By taking the trace of both sides of the equation, it can 
be readily verified fhat the map Ad is trace-preserving 
if and only if 

fr,4pMA^o = 1^1. (Aio) 

Nofe thaf a CP map can be parf of an instrumenf only 

if if is trace-non-increasing, a condition thaf translafes fo 

- tr^o > 0. (All) 

A useful example is fhe CPTP map Airier) = ptra, 
which corresponds fo the preparation of a (normalized) 
stale p independently of fhe input state a. Its CJ repre¬ 
sentation is found fo be 

jy^AiAo ^ ^ (^pT^Ao _ (^12) 

A second relevant case is the CP (not trace-preserving) 
map that gives the probability of observing a POVM 
element E in a measurement: Ad^(p) = tr[Ep] (here 
= 1 ). Its CJ representation is simply 

M^' = E^’. (A13) 

Finally, the situation where a POVM element E is mea¬ 
sured on the state cr in Aj and a state p is prepared 
in Aq corresponds to the CP map A 4 ^ (cr) = ptrjEu], 
which has CJ representation 

jy^AiAo ^ pA, ^ ^ 

2 . Process matrices 

ITere we discuss in more detail some examples and 
properties of process mafrices. 
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a. Quantum states. Consider a bipartite process 
matrix of the form 


y^AiAoBiBo ^ pA[Bi ^ ^AqBo_ (A15) 


According to the generalized Born rule, Eq. (3), the 
probability for the two parties A and B to perform trace 
non-increasing CP maps with CJ matrices and 

respectively, is given by 




= tr 


{E^‘ (A16) 


where E^i := and := 

These operators are positive semidefinite and, because 
of Eq. (All), they can be completed to form a POVM. 
Thus Eq. (A16) corresponds to the probability of ob¬ 
serving the POVM element E^^ O E®' given the state 
pAiBj. other words, the process matrix (A15) de¬ 
scribes a bipartite state. Notice that a process matrix 
of this form does not allow signalling in either direc¬ 
tion and therefore, being compatible with both A -< B 
and B -< A, it is causally separable. This is irrespec¬ 
tive of the state p, which can be entangled or separa¬ 
ble. Note also the difference between the process ma¬ 
trix (A15) and the CJ representation of state prepara¬ 
tion, Eq. (A12). 

b. Channels. Consider a bipartite situation where a 
party A only performs state preparations, while the sec¬ 
ond party B only performs measurements. In this case, 
the local laboratory of A is characterized by a trivial in¬ 
put space, = 1 , while B has a trivial output space, 
dgp = 1 . The process matrix shared by A and B, which 
represents here a quantum channel, is then defined on 
the space Aq <E) Bi 3 W. The probability that B observes 
a POVM element E when A prepares a state p is given 
by 


P(E|p)=tr 




(A17) 


where we used (A12) and (A13) for the local op¬ 
erations. This is equivalent to saying that B mea¬ 
sures E in the state tr,^^ ® = 


tv/^^ (g) pBj . . Comparing this with 

the inverse CJ transformation (A9), we find that the pro¬ 
cess matrix W corresponds to a channel with CJ repre¬ 
sentation In other words, a channel C from Aq to 
B/ is represented by the process matrix 


n T 


W^oBi =X0C(|1))((1|) 


(A18) 


(with here | 1 )) = Note that the CJ represen¬ 

tation of a channel, Eq. (A7), differs by a transposition 
from the corresponding process matrix (A18). 


c. Reduced process matrices. Given a multipartite 
process W = and a CPTP map for the j- 

th party with CJ matrix we define the reduced 

process matrix for the remaining N — 1 parties, given 

as 


IV(M^J^o) 


■= A’A’ 


(1 








(A19) 


With the usual generalized Born rule (3), the reduced 
process matrix gives the probability for the remaining 
N — 1 parties to measure arbitrary CP maps, given that 

the ;-th party performs The explicit depen¬ 
dence of W on accounts for the possibility of sig¬ 

nalling: the remaining parties observe different proba¬ 
bility distributions depending on the choice of CPTP 
map performed by party j. As an example, consider a 
process matrix of the form (A18). If A prepares a state 
p, the reduced process matrix for B is 








YJ^oBi 


E(fc|/|/)C(l7)(fc|)=C(p). 

jk 


(A20) 


Thus, for a process that represents a channel from A to 
B, the reduced process for B, given that A prepares p, is 
simply the channel applied to p, as should be expected. 

d. Pure process matrices. In some cases, the process 
matrix turns out to be a rank-one projector: W = |ic)(iw| 
for some "process vector" |a;). If the CJ operators rep¬ 
resenting the local operations are also rank-one pro¬ 
jectors, as is the case for unitaries and projective mea¬ 
surements followed by pure repreparations, it is conve¬ 
nient to work at the level of vectors and of probability 
amplitudes: given the local operations Ai ,..., rep¬ 
resented by the CJ vectors \A\))^^^o,... ,\Ay))^^ ^ 

the overall probability amplitude is given (up to global 
phase, which we choose to be 0) by 


(g) . . . (g) (A21) 


The probability is then obtained as the modulus square 
of the amplitude and conforms to the general expres¬ 
sion (3). Given that party j performs the unitary Uj, the 
reduced process is clearly given by the partial scalar 
product 

(g)... {{U*\^’‘^o (g)... 

(A22) 

The process matrix describing a unitary channel U 
from Aq to B; is of particular interest. Using (A18), we 
find that it is given by 


ZvfoBi ^ J ^ ^ (^23) 
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Note again the difference between this expression and 
the CJ representation (Ai). Generalizing this to a se¬ 
quence of parties Aj,..., with the output of party j 
connected to the input of party j + 1 via the unitary Uj, 
we find 

I =\Ui)) o ^ 0 • • • 0 |!Jn)) ° '• (A 24 ) 


Appendix B: Valid process matrices 

The conditions for an operator W € A; 0 Aq 0 
Bp (g) Bq to be a valid process matrix were first found 
in Ref. [13], where they were formulated in a basis- 
dependent way. Here we derive the equivalent charac¬ 
terization of valid process matrices given in Eqs. (4)-(6); 
we formulate it in a basis-independent way, which we 
find to be more convenient for our purposes. 

We present below the derivation in the bipartite case, 
and also write explicitly, for ease of reference, the char¬ 
acterization in the tripartite case. The N-partite case 
follows from a straightforward generalization. 


Requiring that its value is non-negative 
for all > g implies that W must be positive 

semidefinite. 

Reciprocally, W > 0 clearly implies that (Bi) is satis¬ 
fied. Hence, the non-negativity of probabilities is equiv¬ 
alent to W being positive semidefinite, Eq. (4). 

Normalization. The fact that probabilities must sum 
up to 1 for all instruments is equivalent to the constraint 
that the probability of realization of any CPTP map is 
1 . Now, recall that a CP map : Ap —t Aq is trace¬ 
preserving if and only if its CJ matrix satisfies 

tr,4Q = t^i 

- or equivalently, using the nota¬ 
tion of Eq. (12), Ignoring here 

for simplicity the possible use of an ancillary system 
(which leads to the same conclusion's)^ the normaliza¬ 
tion of probabilities is thus equivalent to 


tr 


(g) 

V > 0, > 0, 


= 1 


s.t. . 

(B2) 


I. Bipartite process matrices 


Recall that a given operator W £ A/ (g) Aq (g) B; (g) Bq 
is a valid process matrix if and only if it yields, through 
the generalized Born rule (3), only well-defined proba¬ 
bilities - that is, the probabilities must be non-negative 
and must sum up to 1. 

Non-negativity. As recalled previously, a map is 
completely positive if and only if its CJ representation 
is positive semidefinite. Including the possibility that 
A and B's operations involve interactions with a (pos¬ 
sibly entangled) ancillary system in a state the 

non-negativity of probabilities is thus equivalent to’^"'^ 


tr 


(^j^A\AiAo ^ . ^pA'jB'j ^ > q 

V > 0 , > 0, > 0 . (Bi) 


Eor the case where the ancillary spaces A'j and 
B^ are isomorphic to Ap 0 Aq, and and 

p^i^i are both projectors onto the maximally entangled 
state \t))^i^o/AiAo _ ^ (^^ere 

{|;, is an orthonormal basis of 0 

we find that the trace in (Bi) is equal to tr[M®i®'®o . 


Note that ignoring the possibility of an ancillary system, one would 
only find that W must be "positive on pure tensors" (with respect 
to the partition AjAo/BpBo) - a class strictly larger than positive 
semidefinite matrices [17]. 


Eirst of all, note that the positivity of and 

j^BpBo j^g irrelevant here, since the set of positive 
semidefinite operators is a full dimensional subset of 
the space of hermitian operators^^. The only relevant 
conditions are the normalization constraints. Defining 
the maps ]^i_Aq\NI = M — Aq^ [i-Bq]'^ = M — 
BqM, and noting that for any hermitian operators x and 
y, the operators [p-Aq]^ + and [i-Bq]]/ + I/'^Bq 


^5 Taking into account a possible ancillary state the same rea¬ 

soning as below leads to Eqs. (B4) and (B15), with W replaced by 
pA'iSj (g| yjAiAqBiBo (and where the definitions of the maps and 
Lg should include the ancillary systems Bj and Aj, resp.), which 
must hold for all pNK such that trpN^i = 1. One can easily check 
that these are indeed equivalent to (Bq) and (B15) in their original 
form. 

More precisely: for any hermitian operator , there always 

exists £t > 0 such that A /cIaq > 0 . Assuming 

that one can thus decompose as 

PAAiAo (^,+1) with + 

> 0 and > 0 satisfying 

/‘^Aq- Similarly, any hermitian 
operator such that can be decom¬ 
posed as = (/ 3 -I- 1 ) with > 0 , 

1^b,Bo ^ g Note that 

the four pairs satisfy the assumptions of Eq. (B2), 

and therefore tr|^(M(^'^° 8 = 1. Expanding 

tr[(M^i''o 8 using the decomposition just 

constructed, we find that its value is also 1 . Hence, if Eq. (B2) holds, 
then it also holds without the positivity constraints on and 

P/pBiBo ■ the converse is of course trivially true. 
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satisfy the above normalization constraints (where from 
now on we are omitting the superscripts to reduce clut¬ 
tering), we find that Eq. (B2) is equivalent to 


tr 


([l-Ao]^ + ® ([l-Bojy + ^ 

'i X, y. 


= 1 
(B3) 


For X = y = Q, this yields the normalization condition 
of Eq. (5), 


tr[W] = dAodBo • (B4) 

For y = 0 and x = 0 , respectively, this in turn implies 


tr 

tr 


W =0 Vx, 

(1 [i_Bo]y) W =0 Vy, 


which then imply 


tr 


[( 


V X, y. 


W 


= 0 


(B5) 

(B6) 


(By) 


Reciprocally, Eqs. (B4)-(B7) clearly imply (B3), so that 
these are equivalent to Eq. (B2). 

Thinking of the trace as the Hilbert-Schmidt inner 
product 

(M, W) = tr[M ■ W] 

(for hermitian operators M, W) and noting that the 
maps [i-AoY [i-Bo]' self-dual, the condi¬ 

tions (B5)-(B7) are equivalent to 

[i-^o] (= 0 / (B8) 

[l_Bo](trAjylo W) = 0 , (Bq) 

[1-Ao][l-Bo]^ = 0 , (Bio) 

which we can rewrite as 


BiBo^ — AqBiBq^ I (Bll) 

AiAqV^ = AiAqBo^ / (B12) 

W = BoW + ,4oW-AoBoW, (B13) 

which are conditions (9)-(ii). 

Note that each condition (B8)-(Bio) defines a linear 
subspace, and the intersection of these three linear sub¬ 
spaces is the smallest subspace that contains all valid 
bipartite process matrices, which we denote by^^ 

Cy = {W G Aj(E)Aq(E)Bi(E)Bq I W = Ly(W)}, (B14) 


“7 Note that although we do not write that explicitly, the projectors 
we define below (e.g. Ly), and of course the subspaces they define 
(e.g. Cy), depend on the number of parties N. 


The projector onto this subspace, Ly, shall be used quite 
often in the paper, so it is useful to find an explicit ex¬ 
pression for it. To do that, first we rewrite conditions 
(Bii)-(Bi 3) explicitly as projections onto subspaces, i.e., 
as 


W = L,4(W), W = Lb{W), W = L,4b(W), (B15) 

where the projectors L^, Lb, and Lab given by 

LAiW) = W - b,BoW + AoB,BoW, (B16) 

Lb{W) = W - a,AoW + a,AoBoV^, (B17) 

iAB(W) =BoW + AoW-AoBoW, (B18) 

Since the three projectors above commute, the projec¬ 
tor onto the intersection of their subspaces Ly is given 
simply by the composition of La, Lb, and Lab, i e-, 

LyiW) = LA0LB0LAB{W), (B19) 

which, after simplification, can be written as 

Lv{W) = AoW + BoW - AoBo^ 

- BiBo^ + AqBiBo^ 

- AiAqV^ + AiAqBo^ (B20) 

Summing up, we conclude that an operator W G 
Ai® Aq® Bi ig) Bq is a valid bipartite process matrix 
if and only if W > 0, tr W = dAQdBp, and W = Ly(W), 
as in Eqs (4)-(6). 


2. Tripartite process matrices 

A similar reasoning leads to the conclusion that an 
operator W G A/ (g) Aq g) B/ g) Bq g Q g) Cq is a valid 
tripartite process matrix if and only if W > 0 , tr W = 
dAodBodco' ^rid 

W = L,4(W), W = Lb(W), W = Lc(W), 

W = Lab{W), W = Lac{W), W = Lbc{W), 

W = LABdW), (B21) 

where the maps La, Lb, Lq, Lab, Lac, Lbc, and Labc 
are now commuting projectors onto linear subspaces of 
A; g Ao g B; g Bq g C7 g Co, defined by 

La{VV) = [i-^i-Ao)BiBoCiCo]^ ' 

LBiW) = [l-(l-Bo)AjAoCjCo]^' 

Lc{V^) = [l~{l-Co)A,AoBiBo]^' 

Lab{W) = [ 1 _( 1 -Ao)( 1 -Bo)CiCo]^' 

Lac{'^) = [l-{l-yio)(l-Co)BiBo]^' 

hBc(W) = [i_(i_Bo)( 1 -Co)A,Ao]^' 

LaBc{W) = [l-{l-yio){l-Bo)(l-Co)]^' 
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where we used the shorthand notation 

[Ex ‘>‘xX] = Yj‘>^x- X W (B22) 

X 

for a sum over products of subsysfems X wifh coeffi¬ 
cients ctx (and with {\N := W). 

The constraints in (B21) are equivalent to 

W = Lv{W), (B23) 

where the map Ly is obtained here by composing the 7 
maps La, Lb, Lab, J-ac, ^bc, and Labc- One finds 
in this tripartite case, after simplification, 

i'V(fV) - ^^_(^^_a^+a,Ao){1-Bo+BiBo){1-Co+C,Co) ^ 

+ n/AoBjBoQCo] ' 

(B24) 

which defines a projector onto the linear subspace 

Cv = {W G Ai^Aq^Bj^Bq^Cj^Co I w = Ly(W)} . 

(B25) 


3. N-partite process matrices 


The generalization to the N-partite case is rather 
straightforward. We find fhat an operafor W G A| (g) 
Aq 0 ... (g) A^ (g) Aq is a valid N-partife process mafrix 
if and only if W > 0, fr W = and for all 

2 ^ — 1 non-empfy subsefs X of { 1 ,..., N}, 


W = La'(W) 


TliiX AiAQ ^ 


Note that the 2 ^ — 1 maps Lx are commuting projec¬ 
tors onto linear subspaces of A| 0 Aq 0 ... (g) A^ (g) Aq . 
The consfrainfs (B26) are equivalent to 

W = Ly(W), (B27) 


where the map Ly is obtained this time by composing 
the 2 ^ — 1 maps Lx- More explicitly, one finds in the 
N-partite case the general expression 


Ly(W) 


[1 - Tli('^-^o+X^o') + ri/njnjj] 


iW, (B28) 


which again defines a projecfor onto the linear subspace 

= {W G a}® aJ,® ...(g) aJ^OA^ I W = Ly(W)}. 

(B29) 


Proof. For any subset X of { 1 ,...,N}, define Px = 
n!eA'(l “ ^b) ^7^0' ^ there 

exists (at least one) /q such that the product PxPx' con¬ 
tains the factor (1 - A^jA'/’A^- Now, = B, 

so that [P^P^,]fV = 0. 


Developing Ly, we thus find Ly{W) = 
^ [1-Ea-40^’a']^ = [i-LxPx+niXX^]^- 
Now, one can write Ex Px = EI^=o---EI^=o{'^- 
Al)^^{A]Aiy-^K..{l - Ag)^N(AfAg)i-'^N 
n,E^,=o(l - Abj'^-XAjA^i-'^. = n,(l - Ab + AjA^), 

from which Eq. (B28) follows. □ 


Appendix C: Characterization of bipartite causal witnesses 

To characterize below the set of causal wifnesses in 
the bipartite case, we shall make use of some basic def¬ 
initions and facts from convex analysis, which we state 
here without any proof; the interested reader can find 
them for instance in Sections 2,14 and 16 of Ref. [21]^®. 

Let E be a vector space equipped with an irmer prod¬ 
uct {■,■), and let E' be the space of all linear functionals 
on E. In the finite-dimensional case that interests us, E' 
is isomorphic to E. 

1. A subset /C of E is a convex cone if and only if for 
every x, y G /C we also have that Ax -j- yy G 1 C, for 
any A, y > 0 . 

2. Let X C E be a convex cone. Then its dual cone 
1 C* C E' is defined as 

1 C* = {x* G E' I (x*,x) >0 Vx G /C} . (Cl) 

3. The dual of a linear subspace £ C E is its orthog¬ 
onal complement: 

£* = £-L = G E'I (x*,x) =0 Vxg£}. (C2) 

4. Let /Ci,/C2 C E be closed convex cones that con¬ 
tain the origin. Then 

[conv(/Ci U IC2)]* =IClr\IC*2, (C3a) 

(/Cl n IC2)* = conv(/Ci* U IC*2), (C3b) 

where conv denotes the convex hull. 

We shall furthermore use below the following char¬ 
acterization of bipartite causally separable process ma¬ 
trices: 

Lemma 5. A given matrix W G A; ® Aq ^ Bi^ Bq is a 
valid causally separable process matrix if and only if Pc VC = 
do, W G Cy (i.e., it satisfies Eqs. (9)-(ii)), and it can he 
written as 

W = W^^® -h W®^^ (C4) 

with W^^®> 0 , W^^® = BoW^^®, (C5) 

W®^^ > 0, W®^^ = Ao^^^^ ■ (C6) 


Note a difference in language. This reference uses the polar cone 
K° instead of the dual K*. They are simply related by = —K*. 
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Proof. The "only if" direction is straightforward (simply 
replace q —>■ and {'i.—q) W®^"^ —> W®^"^ to 

go from (i8) to (C4), so that W^^® and W®^"^ in (C4) 
are not normalized). 

To see that the converse also holds, first note that 

y^A^B > g > g VV > Q, SO 

that W is indeed a valid process matrix. Note further¬ 
more that W®^"^ = /IqW®^"^ implies that b^BqW®^"^ = 
AqBiBo^^^^^’ i.e., that W®^"^ satisfies (9). Since W e Cy 
also satisfies (9), so does W^^® = W — W®^^. Sim¬ 
ilarly, W"^^® = ’ together with the assump¬ 

tion that W € Cy, implies that both W"^^® and W®^^ 
satisfy (10). Lastly, W"^^® = W®^^ = 

directly imply that both W^^® and W®^^ sat¬ 
isfy (11). All in all, this shows that W^^® and W®^^ 
are, up to normalization (which can easily be dealt with 
as above so as to recover the form (18)), valid causally 
ordered process matrices. □ 


We are now in a position to prove Theorem 1: 

Proof of Theorem 1. We want to characterize the set of all 
hermitian operators S ^ A]® Aq ^ Bj ^ Bq such that 

tr[SW®''P] > 0 (C7) 

for all causally separable process matrices W®^P. Note 
that the condition tr[W®®P] = do is riot relevant for 
the characterization of the witnesses, so we shall lift 
it. Without this restriction the set of (non-normalized) 
causally separable process matrices becomes a convex 
cone, which we denote by W®^P. If we consider the du¬ 
ality relations with respect to the Hilbert-Schmidt inner 
product in the space Aj 0 Aq 0 6/0 Bq, then the con¬ 
vex cone of causal witnesses S is the dual cone of W®^P. 
To characterize it we are going to use the representation 
of W^^P that follows from Lemma 5: 

W^^p =conv[(pn£Bo)u(Pn£,4o)] n£v, (C8) 


Taking then the dual of W®®P using the duality rela¬ 
tions (C2)-(C3), we get that the cone of causal witnesses 
is 


S = conv 


[(Pn£Bo)*n(Pn£,4o)*]u£^ 


(C14) 


Focusing on (P n £bq)*, using again the duality rela¬ 
tions (C2)-(C3), we see that 

{V n LboY = conv{V U £^^) 

= {S+ + So I S+ > 0 , BqSo = 0 } 

= {s I BoS > 0}, (C15) 

where the last equality is stating the fact that S = S+ -|- 
So with S+ > 0 and BqSq = 0 if and only if BqS > 0 . To 
see that this is true, let S be such that BqS > 0 . Define 
then S+ = BqS and So = S — BqS- Then S = S+ -|- So, 
S+ > 0 by assumption, and Bq^o — 0 since the map 
Bq- is a projector. To prove the other direction, let S = 
S+ -|- So with S+ > 0 and Bq^o — 0 - The map Bq' being 
positive, it follows that BqS = Bo‘S+ > 0 . 

Similarly, 


(pn£,ij* = {s I AoS> 0 }. (C16) 

Putting Eqs. (Ci4)-(Ci6) together, we see that a causal 
witness can be written as S = Sp -|- S-'- with Bq^p — 
O/Ao^p > 0, and Ly{S-^) =0. □ 


Appendix D: Explicit positive semidefinite constraints 

For the convenience of the reader, we present the SDP 
problems (38) and (39) with all conic constraints rewrit¬ 
ten in terms of the positive semidefinite cone, to facil¬ 
itate implementation. To rewrite (38), we need a char¬ 
acterisation of the cones Sy and Wf. The first one is 
given by Corollary 2. The second one is obtained as 
follows: since W = P fl £y, we have that 


where V is the self-dual cone of positive semidefinite 
matrices and £bq and Laq the linear subspaces 

£bo = {W I W = BoW}, (C9) 

£,1^ = {W I W = Ao^}> (Cio) 


W*r\£y 

(Di) 

ivn£y)*n£y 

(D2) 

conv{V U £y) n £y 

(D 3 ) 

{Ly{£p)\£p> 0 }, 

(D4) 


and with Cy defined in Eq. (B14). Their orthogonal 
complements within the subspace Ai 0 Aq 0 Bj 0 Bq 
are simply given by 


^Bo = {S 1 BoS = 0}, 

(Cii) 

0 

II 

0 

II 

0 

(C12) 

£^ = {S 1 Ly(S) = 0 }. 

(C13) 


where Equation (D4) follows from an argument analo¬ 
gous to the one used to derive Corollary 2. 

With this characterisation, the SDP problem (38) then 
becomes 

min tr (SW) 

s.t. S = Ly(Sp), AqS? > 0 , BqSp > 0 , (D5) 

\/d-Q — S = Ly(llp ), £p > 0 . 
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To rewrite the SDP problem (39), we use the characteri¬ 
sation of given in Lemma 5: 


To translate this SDP problem into the language of 
Definition 6, let us define 


mintr(n)/do 


s.t. w -E n 

= yy^xB _|_ 



yy^xB ^ j^^yyAxB^ 

(D6) 


yyBxA ^ ^^yyBxA^ 


n > 0, 

Cl = Ly{Cl) . 



Note that we could use directly the definition of 
from Section IIB 2, which would give us a slightly more 
complicated SDP problem. 

Appendix E: Duality for conic problems 


E = £vx£v, (E3) 

/C = W"®PxW, (E4) 

£ = {(n,n) IQ e £y}, (E5) 

b={W, 0 ), (E6) 

c = { 0 , 1 /do), (E7) 

and the irmer product 

((s,i:),(w,n)) =tr(sw)-Etr(i:n). (es) 

With these definitions, and denoting by x = (co. Cl) its 

variable, the primal SDP problem becomes 


In this appendix we show that the two problems de¬ 
fined in Section IIIB are SDP problems and they are 
dual to each other. We show, furthermore, that the 
Duality Theorem applies to them, which implies that 
the optimal solutions can be found efficiently and that 
Equation (40) holds. 

Let us first recall the definitions of primal and dual 
conic problems (Definition 4.2.1 in [22]), of which SDP 
problems are a particular case: 


mm ((0,1/do)/ (‘^/El)) 

\ / (E9) 

s.t. (m, Q) e (VW'^P X W) n {a;=W-EH} , 

which indeed corresponds to the SDP problem (39). 

To construct the dual SDP problem, first note that 

E' = £v X £v, (Eio) 

K.*=SvxW^, (Ell) 

£^ = {(S,-S) |S e £y} (E12) 


Definition 6. Let E he a finite-dimensional vector space, K. 
a closed convex pointed cone in E with a nonempty interior, 
and £ a linear subspace ofE. Let also b £ E and c £ E'. The 
data E, 1 C, £, b, and c define a pair of conic problems 

(P) : min(c, x) s.t. x £ IC (1 {£-\-b), 

(D) : min {y, b) s.t. y £ K.* r\ (£^ -|- c). 


where we used the property that (/Ci x IC2)* = IC^ x /C| 
in equation (Eii). Denoting by y = (S,I 1 ) its variable, 
the dual SDP problem is then 


s.t. 


min ((S,£), (W, 0 )) 

(S,E) € {SvxW^)n{£ = l/do-S}, 


(E13) 


where 1 C* C E' is the cone dual to 1 C, £^ C E is the or¬ 
thogonal complement to £, £ -\- b <£ E and £^ c <£ E' 
are affine subspaces. (P) and (D) are called, respectively, the 
primal and dual problems associated with the above data. 

We want our SDP problems to measure how much 
worst-case noise needs to be added to a given process 
matrix W to make it causally separable, i.e., the minimal 
A > 0 for which 

j^(w + An) (E.) 

is a causally separable process, optimized over all valid 
(normalised) processes O. First note that we can get 
rid of the quadratic variable A H by defining Q = A Q, 
which makes the objective A equal to ti Ci/dp. Remem¬ 
bering also that the normalisation 1/(1 -|- A) is irrele¬ 
vant for conic constraints, the problem reduces to min¬ 
imizing tr Q/do such that 

(E2) 


which corresponds to the SDP problem (38). 

Let us emphasize that here the duals of W®^P and 
W are, respectively, Sy and Wy, instead of S and 
yV*, which is a consequence of choosing the vector 
space £ to be E = F x F with F = £y instead of 
F = Aj ^ Aq ig) B/ (g) Bq. We did this because as sub¬ 
sets of Aj 0 Aq 0 B; 0 Bq, the cones W®®P and W 
(and therefore 1 C = W^^P x W) have empty interiors, 
and therefore these cones would not satisfy the require¬ 
ments of Definition 6. This is problematic because the 
duals of cones with empty interior are not pointed (in 
our case, S and W* are not pointed), and algorithms 
that solve SDP problems are numerically unstable when 
optimizing over non-pointed cones. 

This definition is indeed satisfied by the cones we 
chose, i.e., W®*^P x W C £y x £y is indeed pointed 
and has nonempty interior, as we shall check now. A 
pointed cone /C is a cone such that K. n {— 1 C) = { 0 }. 
This indeed satisfied for W®®P x W, as both cones re¬ 
quire their elements to be positive semidefinite, and 
W > 0 and — W > 0 imply that W = 0 . To show 


w -E Q e w®®P, n e W. 
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that X W has nonempty interior, it is enough^^ to 
find an operator that belongs to int . This is done 
through the following lemma: 

Lemma 7. 1 ° + O e int for any O G such that 
||n||2 < 2^, where dj = and 11-112 is the Hilbert- 

Schmidt norm. 

Proof. Since Cl £ jCy/ the discussion in section IIB 2 im¬ 
plies that the operators 

:= n - (E14) 


This concludes the proof that problems (38) and (39) 
are SDP problems dual to each other. We shall now pro¬ 
ceed to show that the Duality Theorem (Theorem 4.2.1 
in [22]) applies to them: 

Theorem 8. Let (P), (D) be a primal-dual pair of conic prob¬ 
lems as defined above, and let the pair be such that 

1. The set of primal solutions K. ft {£.-\-b) intersects int/C; 

2. The set of dual solutions K.* r\ {C^ -\-c) intersects int K.*; 

3. (c, x) is lower bounded for all x £ K. (1 b). 


are causally ordered (in the sense that they satisfy 
Eq. (17) and the analogous relation for the order B ~< A, 
respectively), and so are the operators 


yjA^B 

:= il° -\-co^^^ 

(E15) 

yyB^A 

:= il° 

(E16) 


Since, furthermore, 

y^A^B yyB^A = 1 ° + (E17) 

we have that 1° -|- O G if and W®^"^ are 

positive semidefinife. This is the case if 


CO 


A-<B 


< 


2 d, 


and 


CO 


B-<A 


< 


2 d, 


(E18) 


where || ■ || is fhe standard operator norm (i.e., the max¬ 
imum singular value). To be able to enforce fhat, first 
note that o;^^® and co^^^ are orthogonal, and therefore 
Pyfhagoras' theorem implies that 




2 

-p 

^B^A 


2 



(E19) 


which implies that 


max 


{ 


^A^B 

max 


/ 

^B<A 

} 

< 

{ 


2' 



j 


< 11E1112, 


(E20) 


and therefore 


l|n||2 < ^ (E21) 

implies fhat 1 ° -|- n G This in turn implies that 

the interior of the ball composed of operafors 1° -|- O 
with n satisfying (E21) belongs to the interior of W®®P, 
i.e., ||n||2 < 2^ implies that 1 ° -|- O G intW^^P. □ 


"9 Since int(W®'^P x W) = intW®‘=P x intW and W‘^'^P C W. 


Then both the primal and the dual problems are solvable, and 
the optimal solutions x* and y* satisfy the relation 

{c,b) = {c,x*) + {y*,b). (E22) 


Let us check that for the SDP problems (E9) and 
(E13), the three assumptions of fhe Duality Theorem 
are indeed satisfied. 

To see that 1. is satisfied, we need to find Q G int W 
such that W -|- Q G intW^^P. Take O = Al°; then 
W-hAl° G intW^'^P iff \iN + 1 ° £ intW^'^P. Using 
Lemma 7, we conclude that this is true if 




(E23) 


Since 



1 

A 


if is enough to choose 



(E24) 


A > 2 .djdQ 


(E25) 


to satisfy inequality (E23), and we're done. 

To see that 2. is satisfied, we need to exhibit a wit¬ 
ness S such that S G intiSy and 1 /dQ — S £ intWy. 
Since the cone V n £y of positive semidefinite matrices 
in £y is a full-dimensional subset of both Sy and Wf, 
it is enough to find an operator S such that S > 0 and 
1 /do — S > 0 . One can take S = 1 /( 2 do). 

To see that 3. is satisfied, nofe fhat O > 0 implies that 
trO/do > 0. 

All in all, the three assumptions of the Duality The¬ 
orem above are thus satisfied. Applying the iden¬ 
tity (E22) to our pair of conic problems, we have, for 
fhe optimal solutions O* and S*: 

0 = tr[n*]/do-htr[S*W], (E26) 

as claimed in Eq. (40). As discussed in Sec. Ill B, a value 
tr[Q*]/do = — tr[S*W] > 0 guarantees that the process 
matrix W is causally nonseparable, and the solution S* 
of the dual problem provides an explicit causal witness; 
a value tr [O*] /dQ = 0 proves that the process matrix W 
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is causally separable, and the primal problem provides 
a decomposition of W in terms of causally ordered pro¬ 
cess mafrices and (again, this is easier to 

see in the representation of fhe primal problem shown 
in (D6)). 


Appendix F: Measuring causal nonseparability 

A causal witness can he used not only to detect the 
causal nonseparability of a given process, but also to 
measure it. This is analogous to the situation with en¬ 
tanglement witnesses and entanglement measures [38]. 
First of all, we need to define what we mean by a mea¬ 
sure of causal nonseparabilify. In analogy wifh the case 
of entanglement, we suggest that a proper measure of 
causal nonseparability N should satisfy the following 
properties: 

Discrimination: A^(W) > 0 for every process mafrix 
W, with A/'(W) = 0 if and only if W is causally 
separable. 

Convexity: AAd],-p,W;) < T.iPiJ^iWi) for any process 
matrices W,- and any p,- > 0, with X]/ Pi = 1- 

Monotony: A^($(W)) < A/’(W), where $(W) is any 
process obtainable from W by composing it with 
local CPTP maps. 


to Bob's input, and the CPTP map M| applied to Bob's 
output. We can then define $(■) as the map such that 
for all processes W and all CP maps and C| we have 
thaf 


tr[(C2^ (8 C®) ■ $(W)] = tr[(Cf23 ^ C®23)W], (F3) 


where 


C 


X 

123 


j0(Af3^oC2^oAff)(|l))((l|) 


(F4) 


is the Choi-Jamiolkowski operator of fhe composifion 
of the each party's operations. The processes W and 
$(W) are illustrated in Figure 5. 


(a) 


(b) 



Figure 5. (a) The situation where the parties share a bipar¬ 
tite process W (in red) and apply the CPTP maps A 4 f and 
A 4 ^ (in blue) to their inputs and outputs can be equivalently 
described by (b) a single bipartite process $(W) (in red). 


Now we shall prove that both Kg(W) and Pr(W) as 
defined in equafions (42) and (46) respect the proper¬ 
ties of Discrimination and Convexity, whereas Rg{W) 
respects Monotony but Pr(W) does not. 

Discrimination follows from the definition of the 
SDP problems (38)-(39) and (45)-(44). Note that since 
they satisfy fhe assumptions of the Duality Theorem 
(8), there are algorithms that actually find the optimal 
solutions efficiently. 

To demonstrate Convexity, let us denote by Sw the 
optimal witness for a given process matrix W; because 
of its optimality, one has, for any process matrices W, 
and any Pi > 0 , 

tr[Sw,W^-] <tr[(SE,.p,.w,)Wy] (Fi) 


and therefore 


-tr 


i 


< 


Ep,tr[Sw;W,], 


(F2) 


that is, AfiEi pm < Ei Pi^m- 
Now we show that Monotony does hold for Rg{W). 
For fhat, first we need to define the map $(■) that com¬ 
poses a process W with local operations. More specif¬ 
ically, the map $(■) composes a process with the CPTP 
map applied to Alice's input, the CPTP map 
applied to Alice's output, the CPTP map M® applied 


It follows from this definition that $(W) is a valid 
process. To see this, note that the validity of W implies 
that the probabilities 

— ^[{^123 ® ^123)^] (F5) 

are positive and normalised. By definition, these are 
equal to the probabilities 

P{CtCl) = tr[(C2^ 0 C®) ■ $(W)], (F6) 


and the arguments in Appendix B show that requir¬ 
ing the probabilities P(C^,C|) to be positive and nor¬ 
malised is enough to imply the validity of the process 
$(W). 

Furthermore, if W is causally separable so is $(W). 
This follows from the linearity of $ (■) and from the fact 
that $ (■) preserves the causal order when applied to a 
causally ordered process, which follows directly from 
the analogous property for quantum combs [14]. 

We want to show that for all $(■) (i.e., for all CPTP 
maps M^, M^, M® and M®) and W, 

Rg($(W)) < Pg(W), (Fy) 

or equivalently that 


-tr 


■(s$(w))$(W)] <-tr[(Sw)W]. (F8) 
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By duality, this is equivalent to 


-tr 


$*(S$(W))W] < -tr[(Sw)W] 


(F 9 ) 


(where $* is the dual map of $), which follows from 
the optimality of Sw if $* is a valid causal wit¬ 

ness that respects the normalisation condition for gen¬ 
eralised robustness (as defined in SDP problem (38)). 
Therefore, we need to show it has the two following 
properties: 


tr 


$*(S$(W)) > 0 VW®"P, 

i/do-$*(s$(w)) e w*. 

The first one follows from duality 

tr =tr[s$(w)$(W^‘^P) 


(Fio) 

(Fii) 

(F12) 


and the fact that $(W®®P) is causally separable and 
S$(w) is ^ causal witness. 

The second one is equivalent to 


tr 


■(l/do-$*(s$(w))) 


n 


> 0 


(F13) 


for every (not necessarily normalised) process matrix 
n. From duality and linearity this is equivalent to 


tr 


S$(w) $(F1) 


< tr(n)/do/ 


(F14) 


and this follows from the fact that $(■) is trace¬ 
preserving and that l/do “ S${w) ^ FV* (which is the 
normalization condition from the SDF problem (38)). 

An analogous proof fails for random robustness, as 
the dual map $*(■) can increase the trace of a witness, 
and therefore make it fail to satisfy the normalisation 
condition for SDF problem (44). To show that Fr(W) 
does not in fact satisfy Monotony, it is enough to find 
a process and local operations such that Kr($(W)) > 

Rr(W). 

A concrete counterexample can be obtained by con¬ 
sidering Wqcb srid Sqcb from section III D. Let 


Wi = Wqcb ® (F15) 


be the process obtained from Wqcb by adding a maxi¬ 
mally mixed qubit to Alice's input space. Then its ran¬ 
dom robustness is (up to numerical precision) 


Rr(Wi) =-trSwi Wi = V2-I, (F16) 


where 

Swi =2 Socb® |0)(0|'^^ (F17) 


is its optimal random robustness witness. Now, we can 
obtain the process 

$(Wi) = WocB®| 0 )( 0 |'^^ (F18) 

from Wi simply by discarding the system in Alice's 
input space and replacing it with |0)(0|, which is 
clearly a local operation. Then its random robustness is 
(up to numerical precision) 


R,($(Wi)) = - tr S$(wj) $(Wi) =2(V2-1), (F19) 

where = Swj- Thus we have shown that 

Rr($(Wi)) > Rr(Wi), (F20) 

SO random robustness is not monotonous under local 
operations. 


Appendix G: Characterisation of tripartite causal witnesses 


Proof of Theorem 3. As defined in section IIB 3, the cone 
of tripartite causally separable processes with dc^ = 1 
is 


= conv [(72 n Ca^b<c) U (P n Cb^aac)] , (Gi) 

where CaaBaC C^^aaC ^re the linear subspaces 
defined by the projectors LaaBaC LbaAaC- The 
cone of causal witnesses S^c is its dual 

53c = W37* ■ (G2) 


Using duality relations (C3) and (C2) and the fact that 
the cone of positive semidefinite matrices is self-dual, 
we get 


^ 3 c — {Vn Caabac)* n (72 n Cbaaac)* 


conv( 72 u£i^B^c)j 


n 


conv 


{v 


G ^baAaC 


(G3) 

) 


(G4) 


with 

COIw(r U = 

+ ‘^ABC I ^ABC — 0/ 1-AaBac{SaBc) = 0} (G5) 

and 


conv(-pU£^^^^c) = 

{■^BAC + ^BAC I ^BAC — 0 ' ^BaAac{SbAc) ~ b} • (G6) 


□ 


















26 


Appendix H: Optimizing Chiribella's task 


We want to optimize the weights ^ so as 

to minimize the maximal probability of success^” for 
causally separable processes Psucc/ i-e-/ we want to min¬ 
imize the upper bound 

tr(GfMteW^"P) < (Hi) 

This is relevant because, according to equation (91), a 
lower Psucc corresponds to a larger resistance to worst- 
case noise. 

To do this, note that tr(GfinitefV) < Psucc if arid only if 
tr[(psuccl/^o “ Gfjnite)iV] > 0. Imposing that this holds 
for all causally separable processes W G Wg^P amounts 
to imposing that Psu?c^ “ Gfinite S where So,c 

is the cone of causal witnesses (characterized through 
Theorem 3). 

We are thus led to define the following SDP problem: 



min 

„sep 

Psucc 

s.t. 

Psu?cl/rfo 

Gfinite ^ ^3C f 



10 , , 

’ >0, 

+ 

II 




0 Vf,/ 

s.t. [Ui, Uj] ^ 0, 

4 ’ = 

0 Vz,/ 

s.t. {Ui,Uj}^0, 


where in order to keep the interpretation of the task as 
guessing whether the unitaries commute or anticom¬ 
mute, we imposed that q\'^ = 0 for non-commuting 

Ui, Uj and qjj' ^ = 0 for non-anticommuting Lf,-, Uj. 
Solving this problem numerically, we found 

« 0.8690 (H3) 

(and we omit the optimal q\j\q\j'^ for brevity). 


“ Remember that the probability of success for the quantum switch 
is always equal to one. 
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